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Preface 


The idea for this book comes from my experience teaching physics and 
especially particle physics at the university level for more than 25 years. I 
noticed that historical background to physics livens up the subject, increases 
student interest by giving a human touch to an otherwise rather abstract 
matter. When I stumbled over Steven Weinberg’s book, The Discovery of 
Subatomic Particles [548], which deals with the discovery of protons, neu- 
trons and electrons, I realised that this experience is shared by others. 
When I taught the online courses “Particle Physics — An Introduction” 
(https: //www.coursera.org/learn/particle-physics) and its French ver- 
sion with Mercedes Paniccia and other colleagues, I thought that a book using 
the history of particle physics as a guideline would be a good companion. 

As an experimental particle physicist, I did not venture to make an original 
contribution to the history of science. That is why I did not really respect the 
chronology of discovery. Rather, historical facts, personalities and the often 
colourful stories of major discoveries serve as didactic tools to motivate more 
in-depth arguments about physics. In finding my narrative I have greatly prof- 
ited from Helge Kragh’s work Quantum Generations [514] and the chronolog- 
ical bibliography Particle Physics: One Hundred Years of Discoveries [480], 
carefully compiled by Vladimir V. Ezhela and many others in a U.S.-Russian 
collaboration. 

The body of the text uses little mathematics. More technical discussions 
are concentrated in Focus Boxes, which — I hope — give the necessary con- 
densed information to the reader, such that more ample treatment can be 
found elsewhere. Suggestions for this “elsewhere” are given at the end of each 
chapter. I generally refrained from explaining mathematical terms; adequate 
definitions can be found, e.g., on Wikipedia. I also rarely included biographical 
details for the many actors contributing to particle physics in the past hun- 
dred years. For most of them, concise entries exist, e.g., in the Encyclopedia 
Britannica; more elaborate biographies are referenced in the text or included 
in the Further Reading sections of each chapter. 

The reader I had in mind when writing this book is a colleague of mine in 
spirit. Someone who has a background in physics, at least in classical physics, 
and an open mind for unusual concepts and facts. Maybe the reader has not 
followed particle physics for a while and wonders what happened since last 
in contact with academia. Maybe the reader is looking for input in his or 
her own teaching at whatever level, or just intellectual stimulus. In any case, 


x WE Preface 


the reader is someone who, like me, believes that physics should be based 
on observing nature, describing the results in mathematical language and 
ultimately condensing it into theory. That the order of these essential steps is 
rather often reversed, is an interesting fact all by itself. 

I am grateful to Mercedes Paniccia from University of Geneva for carefully 
assessing the text from both the didactic and the physics points of view, as 
she had already done so for the online courses. She eliminated countless errors 
and flaws from the manuscript. The many remaining mistakes are, of course, 
all mine. 


Martin Pohl 
Hamburg 


CHAPTER 1 


Introduction 


The whole truth would be an infinite concatenation of mostly irrel- 
evant facts. ..So we do not tell the whole truth; we tell carefully crafted 
stories, and we do this even when our moral purpose is to tell the truth. 


Justin E.H. Smith, Irrationality, 2019 [686]* 


DO NOT KNOW how you feel, but I have often been bored by the history 

of sciences. Who exactly published what detail first does not really fasci- 
nate me. What I am interested in and what will consequently be the subject 
of this book is the great steps that brought us to where we are in particle 
physics today. The disruptive ideas, the unexpected experimental results that 
have determined the course of research, and what it took for the scientific 
community to accept and embrace them. 

I am not very interested in the historian’s viewpoint since I am an exper- 
imental particle physicist. I will not project myself (and you) back in time, 
but use today’s knowledge to identify (and sometimes invent) a narrative for 
the progress that brought us to what is called the Standard Model of particle 
physics. I also consistently use terms in their modern definition, often not 
available to physicists at the time. You see, I take the Whig history point of 
view, often criticised by real historians. 

So this is not a book about the history of my science; it is a book about 
particle physics. It uses historical landmarks as a vehicle to give an idea on 
how processes of innovation work in physics, instead of just explaining the end 
result. At any rate, the “end result” is never the end of the process, just a 
major step along the way. What is very important to me is the relation between 
theory and experiment, so you will find experimental details not only where 
you expect them. I have nonetheless tried to do justice to historical facts, 
mostly based on secondary literature, but also on the original publications I 
quote. 

I owe you some clarification about my personal philosophy of sciences, even 
though I am far from pretending to be a philosopher. I have a reductionist 


‘Quoted by permission from Princeton University Press. 
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model of the progress in physics, which is well represented by the diagram in 
Figure 1. From the top down, more phenomena are explained by fewer con- 
cepts, laws and parameters, thus reducing their number, not necessary their 
complexity. In not much more than a century we have progressed impressively. 
As far as forces are concerned, electromagnetic, weak and strong interactions 
are now described by a common framework, although not completely unified. 
Gravity has been understood as a classical field theory, however with no quan- 
tum theory to describe its workings at small distances. Two layers of matter 
constituents have been identified, from the atom to electrons, protons and 
neutrons, and further on to six quarks and six leptons which may well be 
truly elementary. And the description of matter and forces has been unified 
in the quantum field theory of the Standard Model. Finally, mass has been 
understood to have a dynamical origin. I will sketch the fascinating concepts 
behind this progress in this book. 


Celestial Terrestrial Nuclear Weak Electricity Magnetism Condensed Gaseous 


Gravity Gravity Force Force Matter Matter 
Electromagnetism 
Universal | eg 
Atoms 
Gravity Quantum- . | 
electrodynamics Quantum 
General Matter 
Relativity Quantum Electroweak 
Chromodynamics Theory 
Quantum — 
Gravity 
??? Standard Quarks 
Model Leptons 
Dark Energy Symmetry Breaking Dark Matter 
222 Higgs 222 
Symmetry 
222 
"Final Theory" 
222 


Figure 1.1 A reductionist view of the development of particle physics. Forces 
are represented to the left, matter to the right of the diagram. Missing elements 
known today are indicated by triple question marks. 


Ideally—but definitely not in our lifetime, and probably never—this reduc- 
tion process would culminate into a truly unified theory of the Universe, 
including elements dominating its evolution at cosmic scales. The impossible 
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dream behind particle physics, to understand the Universe by its microscopic 
properties, would then have come true. As Einstein put it [289]?: 


...from the beginning there has always been the attempt to 
find a unifying theoretical basis for all these single sciences, con- 
sisting of a minimum of concepts and fundamental relationships, 
from which all concepts and relationships of the single disciplines 
might be derived by logical process. This is what we mean by the 
search for a foundation of the whole of physics. The confident belief 
that this ultimate goal may be reached is the chief source of the 
passionate devotion which has always animated the researcher. 


At the very end, in Chapter 10, I will critically assess this approach as a 
guideline for further progress. 

I will analyse events of Standard Model history from the standpoint of 
one of its victims. I learned much of physics from the “Feynman Lectures on 
Physics” and got my degree when weak neutral current interactions were dis- 
covered in the laboratory I was part of. As an experimental particle physicist 
in the late 20th and early 21st century, I have been running after Standard 
Model theoretical predictions for most of my professional life, trying in vain 
to find a breach in its fortifications. That causes some kind of fatalism in my 
view of its past and future, a bias that you should be aware of when critically 
assessing what I write here. I am a physicist by training so I cannot refrain 
from using the way physicists are thinking. As one of my teachers used to say: 
the truth is in the formulae, not in the blabla. I will nevertheless relegate most 
“technical” detail to Focus Boxes that should explain it to the necessary level 
and which you can skip if you are familiar with, say, classical electrodynamics, 
special relativity, quantum mechanics or quantum field theory. I suspect that 
you would find it most easy to follow my arguments if you had a basic science 
education, a major or a bachelor in a “hard” science subject. 

Before we start to talk about the subjects of this book, I will remind you 
about elements of classical physics, which are necessary to understand what 
follows, like Newton’s and Maxwell’s laws and the concept of ether. If you 
are familiar with those, I invite you to skip section 2.1 and to follow me to 
what I took to be my starting point: the discovery of the electron by Walter 
Kaufmann, Emil Wiechert and most prominently Joseph John Thomson. 


FURTHER READING 


Richard P. Feynman, Robert B. Leighton and Matthew Sands, The Feynman 
Lectures on Physics: The New Millennium Edition, Basic Books, 2011. 


?Quoted by permission from Science Magazine. 
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CHAPTER 2 


The first particles 


As in Mathematicks, so in Natural Philosophy, the Investigation 
of difficult Things by the Method of Analysis, ought ever to precede 
the Method of Composition. This Analysis consists in making Experi- 
ments and Observations, and in drawing general Conclusions from them 
by Induction, and admitting of no Objections against the Conclusions, 
but such as are taken from Experiments, or other certain Truths. For 
Hypotheses are not to be regarded in experimental Philosophy. And 
although the arguing from Experiments and Observations by Induction 
be no Demonstration of general Conclusions; yet it is the best way of 
arguing which the Nature of Things admits of, and may be looked upon 
as so much the stronger, by how much the Induction is more general. 


Isaac Newton, Opticks, 1704 [2] 


cal steam engines in the 18th century, moved science -and especially 
physics— from a curiosity for aristocracy (used for amusement, warfare, or 
transmutation of elements to finance their follies) into the realm of practi- 
cal applications pervading every day life. This transition was also fostered by 
the formation of scientific and physical societies in Europe and the USA, like 
DPG (Germany) in 1845, SFP (France) in 1873, IoP (Britain) in 1874 and 
APS (USA) in 1899. These were more open to the outside than the academies 
of science founded in the 18th century and served as platforms for discus- 
sion and more rapid dissemination of new physics. A good indicator for these 
dynamics is the wave of polytechnic schools that were created in the 19th cen- 
tury, starting with the Ecole polytechnique in Paris founded during the French 
Revolution in 1794, and followed by engineering schools all over Europe and in 
the USA, on public as well as private initiative. They quickly rose to high aca- 
demic standards. Examples are ETH Ziirich (1855), MIT Cambridge (1861), 
RWTH Aachen (1879) and Caltech Pasadena (1891). An exception is Britain 
where the training of engineers stayed with industry for a long period. 
Physicists in the 19th century —and scientists in general- felt deeply imbed- 
ded in general culture. In particular to German physicists, music making and 


Te INDUSTRIAL REVOLUTION, set off by the development of practi- 
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the appreciation of music was very important [562]. This stayed so well into 
the 20th century". 

This first chapter sets the scene for the start of discoveries in particle 
physics by introducing necessary scientific ingredients: the understanding of 
motion and the mastering of electromagnetism. The former describes how 
forces change the movement of objects according to classical physics, with 
differential equations and their application in Newton’s law. For the latter, 
Maxwell’s laws completed classical electrodynamics with its enormous eco- 
nomic and societal impact. That impact had already set off with the revolution 
of communication by the electric telegraph in the 1830s, the introduction of 
commercial electric light in the 1870s and pioneering work on wireless commu- 
nication by Guglielmo Marconi and Karl Ferdinand Braun in the 1890s. I will 
discuss experiments with cathode rays in the 1890s by Wiechert, Kaufmann 
and most prominently Joseph John Thomson, which led to the discovery of 
the electron, the first matter particle in modern terms. Millikan’s experiment 
on charge quantisation, although later in time, will also be described in this 
context. 


2.1 GIANTS’ SHOULDERS 


The aim of classical physics is to predict the evolution of a system of objects, 
when the initial conditions of all ingredients and the forces that act upon them 
are known. The idealised object of classical physics is the mass point, an object 
with no extension, but with properties like mass and electrical charge. Initial 
conditions are specified as the spatial location and velocities of all objects at 
some initial time, say t = 0. The positions and velocities of each object at 
any other time are then determined by Newton’s law [1], as specified in Focus 
Box 2.1. The law is named after Isaac Newton, the father of classical physics. 
Newton’s law is a differential equation linking the acceleration of a mass point, 
i.e. its infinitesimal change of velocity per infinitesimal time period, to the 
forces that act at the place and time where the mass point is. Acceleration 
and force are in fact proportional to each other, the proportionality factor is 
the mass. It thus takes twice as much force to give the same acceleration to 
an object which is twice as massive. 

Position r(t), velocity v(t) and acceleration are vectors in the three- 
dimensional Euclidian space. The coordinate system has three axes that a 
Cartesian will chose as orthogonal. Origin and direction of the axes are arbi- 
trary. Vectors specify a value as well as a direction in the chosen coordinate 
system. Consequently, also the force is a vector, with value and direction. If 
there are several forces, their vector sum determines the acceleration. You may 
remember the parallelogram of forces: it is the two dimensional representation 
of the vector sum. 


lFor a well-known example see J. Weidman, The Night I Met Einstein, Readers’ Digest 
November 1955. 
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In classical physics, forces determine how objects move. If there are none, 
the acceleration is zero; the object moves at constant velocity, equal to its 
initial value, and in a straight line along its initial direction. If a force acts, 
the velocity changes in value and/or direction. If the force is always aligned 
with the object’s velocity, only the speed will change. Otherwise speed and 
direction will change. A special case is a force always orthogonal to the object’s 
velocity. Such a force will only change the direction of the velocity, but let it 
travel with constant speed. See Focus Box 2.1 for examples of these special 
cases. 


Newton’s law specifies how the momentum p(t) of an object changes under 


dp(t) Ë 


the influence of a force F: 


dt 


The momentum of a mass point is the product of its mass m and velocity v(t) 
(see Focus Box 2.2). Velocity is the rate of change of position r(t) per time 
t, v = dr/dt. Thus if the mass is constant, the rate of change of momentum 
is md?F/dt? = F. The acceleration of a mass point, i.e. the rate of change of 
its velocity d?7/dt?, is proportional to the applied force; the proportionality 
factor is its mass. 

If there is no force at all, the momentum of the mass point remains constant. 
If there is a force F , but it has no component along a given axis, the 
momentum component along this axis stays unchanged. Otherwise, the force 
component causes acceleration in that direction. Mind that depending on the 
sign, acceleration may also be reducing the speed. 


A first example is the action of a force constant in space and time, such as 
the electric force on a point-like charge inside a parallel plate capacitor (see 
Focus Box 2.5). This will lead to a constant acceleration in the direction of 
the force. a 

As a second example let us consider a force F' con- 
stant in value and always orthogonal to the velocity 
v, such as the force on a charged mass point moving 
in a constant magnetic field (see Focus Box 2.5). In 
this case, since F L 7 at all times, the speed |v| does 
not change, but its direction receives a constant shift. 
The force then constrains the mass point to move on 
a circular path with radius R = mv?/F at constant 
angular velocity w = v/R in the plane defined by the 
force and the initial velocity vector. 


x! 


If the mass point had an initial velocity component orthogonal to that plane, 
it will stay constant, leading to a helical movement in three dimensions. 


Focus Box 2.1: Newton’s law 
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Since Newton’s law is an ordinary differential equation -i.e. the position 
of a mass point does not depend on any variable other than time- it can 
simply be integrated over time provided we know an analytical expression 
for the force as a function of time and space. A first integration will deliver 
the velocity vector as a function of time, on the left side of the equation we 
thus get the momentum of the mass point (see Focus Boxes 2.1 and 2.2). A 
second integration will give us the position of the object as a function of time. 
Each time we integrate, a constant needs to be fixed. The initial conditions of 
the system specify them for us, i.e. fix the initial position and velocity of the 
object. 

We thus have a law of motion where forces act continuously and inevitably 
on the objects, as sketched in Fig. 2.1. There is no option: if a force can act 
on an object, it will. And it will do so wherever it reigns. However, not all 
forces act on all objects alike. For example, the gravitational force acts if the 
object has mass, the electromagnetic force if it has electric charge. 

Classical physics thus predicts what is called a trajectory for our mass 
points. A trajectory is a smooth curve in space as a function of time, since 
it is determined by a differential equation. And it is predicted accurately 
and reliably by Newton’s mechanics, otherwise you could not drive your car 
responsibly. That remains so as long as all distances are macroscopic, i.e. large 
compared to the size of an atom, about 50 pm for hydrogen? as an example. 
And as long as its velocity stays far below the speed of light, about 300,000 
km per second. So, in your car you are safe, at least as far as these conditions 
are concerned. 


final state 


initial state 


Figure 2.1 The classical action of a force on the movement of a mass point. 
The trajectory is a smooth curve as a function of time. 


Important abstract characteristics of motion developed by classical physics 
are the concepts of energy and momentum, as explained in Focus Box 2.2. Got- 
tfried Wilhelm von Leipniz was probably the first in the 1680s to use a math- 
ematical formulation of energy close to its modern sense, calling it vis viva, a 
name which stuck for a long time. Thomas Young in the 19th century coined 
the term energy. Julius Robert von Mayer, James Prescott Joule, Hermann von 
Helmholtz and William Thomson (Lord Kelvin) established the conservation 


2A picometer (pm) is 10~!?m, one millionth of a millionth of a meter. 
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of energy and its role in physics: Energy is a condition that describes the 
capacity of a system to do work. It cannot be created or destroyed, but it can 
be transformed from one form to another. 

The formulation of momentum as a characteristic of motion took longer. 
In the 17th century René Descartes and Galileo Galilei introduced a quantity 
proportional to size and speed, which they called quantitas motus and impeto, 
respectively. The first term was retained by Isaac Newton in his Principia 
Mathematica [1], as a constant of motion which can only be changed by a 
force. 

Apart from heat, the phenomena that classical physics is mostly concerned 
with are the gravitational and the electromagnetic interactions. Both can be 
described by an action-at-a-distance law as well as a local one. The latter 
involves the notion of the classical force field and thus deserves our special 
attention. Let us start with the gravitational force. Newton’s action-at-a- 
distance law of the universal gravitational force between a mass M at location 
0 and another mass m at F is: 

Fazer 


r2 


The gravitational constant G determines the order of magnitude of the force, 
its strength. The masses m and M of the two objects are both the source and 
the object of the interaction; they are interchangeable. The force diminishes 
quadratically with the distance r = |r] between the two objects; when the 
distance is doubled, the force is four times less. The vector f of unit length 
points from the location of M (the origin of our coordinates) towards where m 
is. The minus sign thus indicates that we are dealing with an attractive force. 
Had we put m at the origin instead, the direction of f would reverse, but 
the strength of the force would be the same. The force thus only depends on 
the relative position of the two masses. We rightly talk about an inter-action: 
the force is completely symmetric, source and object of the force cannot be 
distinguished. 

Thus the Moon attracts the Earth with the same force as the one by 
which the Earth attracts the Moon. But the consequences are quite different. 
Since the Earth’s mass is about 80 times larger than that of the Moon, it is 
accelerated 80 times less by the same force. Thus we can usually assume that 
the Moon rotates around the Earth centre, while in reality both rotate around 
a common centre, displaced from the centre of the Earth by about 3/4 of the 
Earth radius. 

With this law we know how large the gravitational force is and in which 
direction it acts. But how does an object know that the other is there? A 
first step towards the answer is the classical gravitational field. A field is a 
physical quantity attributed to a point in space, or in space and time as we 
will see later. This can be a scalar quantity such as temperature. When we 
map out the temperatures in a room, we will probably find that it is larger 
close to a heat source (because of radiation) and close to the ceiling compared 
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Kinetic energy Ekin, proportional to mass m and square of velocity v of an 
object, is the motor of motion: 


l 2 
Ein = zum 
One injects energy into a system by doing work on it. Working against a force 


F by transporting an object along a trajectory 7, its potential energy, Epot, 
is increased: 


Epot = [Far 


It is “potential”, because it can be stored by holding and realised by releasing 
the object. It is also called potential because it can be seen as an energy 
associated to position inside a potential ®, if the force results from such a 
potential, i.e. F = -V®. The force is then called a conservative force. This 
term was coined by William Rankine in 1853 [20]. 

The importance of momentum in characterising classical motion took a long 
time to be realised. Several attempts to construct a “quantity of motion” 
finally resulted in Isaac Newton’s conservation law for the product of mass 
and velocity, p = mv. It does not change if there is no force acting (see Focus 
Box 2.1). But a force can change its absolute value, its direction, or both, 
depending on the angle between force and momentum. 

Indeed the importance of energy and momentum for kinematics is that both 
are strictly conserved quantities in closed systems. In case of energy, one must 
however include all its forms. When all intervening forces result from a poten- 
tial, only kinetic and potential energies occur. The conserved quantity is then 
the sum of the two, E = Ekin +Epot, but an exchange between the two remains 
possible. Potential energy can be transformed into kinetic one and vice versa. 
Likewise, the total momentum of a closed system is strictly conserved. Thus 
the vector sum of all momenta is a constant. How these conservation laws 
come about is clarified by Noether’s theorem (see Focus Box 9.6). 


Focus Box 2.2: Energy and momentum 
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to the floor (because of convection, unless the floor itself is the heat source). 
Interpolating the map, we can probably define an analytic function T(r) that 
describes it, as a function of position r. The large scale numerical equivalent 
is one of the ingredients that weather models use to make predictions. The 
field can also be a vector as in the case of gravitation. Let us treat M as the 
source of a gravitational field g: 


p Pi 
g(r) = -GM 5 


You will notice that we have simply taken the field to be g(7) = F(r)/m, but 
the approach changes more radically than that. We have now declared M to 
be the source of the field and m to be a probe that can be used to measure 
its strength. But the field 9(7) is independent of the probe, it has become a 
property of each point in space. We can then express the gravitational force 
as a local interaction between field and probe: 


Fir) = mal) 


You will rightly complain that we have still not specified how the source trans- 
forms the space around it to have the required property. And since time does 
not appear in the laws quoted so far, the action even appears to be instan- 
taneous. Indeed classical physics is unable to answer these questions. Newton 
himself was well aware of this problem. In a letter to Bentley he wrote in 
1692°: 


It is inconceivable that inanimate Matter should, without the 
Mediation of something else, which is not material, operate upon, 
and affect other matter without mutual Contact... That Gravity 
should be innate, inherent and essential to Matter, so that one 
body may act upon another at a distance thro’ the Vacuum, with- 
out Mediation of any thing else, by and through which their Action 
and Force may be conveyed from one to another, is to me so great 
an Absurdity that I believe no Man who has in philosophical Mat- 
ters a competent Faculty of thinking can ever fall into it. Gravity 
must be caused by an Agent acting constantly according to certain 
laws; but whether this Agent be material of immaterial, I have left 
to the Consideration of my readers. 


Coulomb’s law, named after Charles-Augustin de Coulomb, which 
describes the force between two static electric charges Q and q in the same 
spatial configuration as the two masses above, has a completely analogous 
form to Newton’s gravitational law: 

> 1 
i i 


4Teo r? 


3See: Stanford Encyclopedia of Philosophy, 
https://plato.stanford.edu/entries/newton-philosophy/ 
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It thus shares the same properties: it describes action at a distance, in a 
symmetric way between source and probe. And even when reformulated using 
the local electric field E created by the electric charge Q, i.e. F(r) = qE(*), 
there is no information on what bridges the gap between the source Q and the 
probe q. Electromagnetic interactions take a leading role in what follows. The 
reason is that there is a complete classical field theory of electromagnetism, 
first formulated by James Clerk Maxwell in his “Treatise on Electricity and 
Magnetism” [38], published in 1873. The seminal importance of this work was 
immediately recognised by his contemporaries. A book review in Nature [39] 
in the same year celebrated the Treatise enthusiastically: 


We could easily point to whole treatises (some of them in many 
volumes) still accepted as standard works, in which there is not 
(throughout) a tithe of the originality and exhaustiveness to be 
found in any one of Maxwell’s chapters. 


The book discusses electrical and magnetic phenomena in a way that clearly 
separates the generation of the electric and magnetic fields from their action 
on electric charges. In his original publication, Maxwell presented the equa- 
tions in less compact form than what we present in Focus Box 2.3. However, 
it was quickly realised that they can be reformulated using vector calculus, 
as we do here. This is natural, since both electric and magnetic fields are 
characterised by intensity and direction. Using the differential operators of 
divergence and curl, Oliver Heaviside, a largely self-taught engineer, mathe- 
matician and physicist, arrived at the four partial differential equations we 
quote. They show how electric fields converge towards or diverge from their 
source, positive and negative electric charges (Gauss’ law). And how mag- 
netic fields curl around electric currents, charges in motion (Ampere’s law). 
See Figure 2.2 for a simple graphic representation of these ways to generate 
the two fields. 


Figure 2.2 Left: A positive electric charge causes the electric field to diverge 
from its position (Gauss’ law). A variable magnetic field causes an electric 
one that curls around it (Faraday’s law). Right: An electric current causes 
a magnetic field that curls around the current (Ampere’s law). A variable 
electric field also does that (Maxwell’s completion of Ampere’s law). 


A dramatic feature of Maxwell’s equations is that they also describe how 
electric and magnetic fields interact directly with each other. For that, they 
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Maxwell’s equations in vacuo read as follows: 


VE= ap (Gauss’ law) VB=0 
VxE= we. (Faraday’s Law) VxB= ko (i+ co) 


(Ampere/Maxwell law) 


Gauss’ law specifies how the electric field vector E(7) diverges from its source, 
a local charge density p(7). The magnetic field B(7) has no such sources, the 
magnetic field has no divergence. Faraday’s law specifies how electric fields curl 
around a magnetic field, B (t), varying in time. Ampere’s law, corresponding 
to the first term in the last equation identifies the current density j(7) as the 
prime source of magnetic fields. Currents being made of charges in motion, 
there is thus a single source of electromagnetic fields, electric charge. A sec- 
ond term was added by Maxwell to make the whole set consistent. It says that 
magnetic fields also curl around variable electric fields, E (t), and establishes 
symmetry between electric and magnetic fields as far as that is concerned. 
The constants permittivity eo and permeability uo are universal properties 
of empty space, fixed by measurement. Together they specify how fast elec- 
tromagnetic phenomena propagate through space-time, i.e. the speed of light 
N 
Since electromagnetic forces are conservative, we can express Maxwell’s laws 
in terms of the electric and magnetic potentials V and A; such that E = 
“TV = A and B = V x A. The potentials form a four-vector A, = (V/c, A). 
Four-vectors are explained in Focus Box 3.6. Using the four-vector of the elec- 
tromagnetic current density j” = (cp,j), one can write Maxwell’s equations 
in their most compact form: 0,0” A” = (Ar/c)j", where 0,0” = a v2, 
They are invariant under gauge transformations: 

A> A =A+VA 

; ON 

V > W=V+ ER 
with an arbitrary scalar function A(x) of the space-time four-vector x. These 
transformations can be written in covariant form as AH — Ar = AH + OFA. 
They leave the electric and magnetic fields untouched. 


Focus Box 2.3: Maxwell’s equations 
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do not need charges and currents. In the absence of both, variable magnetic 
fields cause curling electric fields to appear as sketched on the left of Fig- 
ure 2.3. This fact had been shown earlier by Michael Faraday in a series of 
brilliant experiments on induction. Maxwell added a term to Ampere’s law 
which removed a mathematical inconsistency in the set of equations known 
beforehand. His term claimed that also varying electric fields cause magnetic 
fields to curl around them, much as electric currents do (see Figure 2.3). The 
dramatic consequence of this addition is that electric and magnetic fields can 
now propagate in empty space without intervening charges and currents, as 
explained in Focus Box 2.4. If we set all charge and current densities to zero in 
Maxwell’s equations, we arrive at electric and magnetic fields with no diver- 
gence, but a curl proportional to the rate of change in the other field, i.e. 
magnetic and electric, respectively. The solution to this set of equations is an 
electromagnetic wave as sketched on the right of Figure 2.3. Maxwell identified 
visible light with this electromagnetic phenomenon. When you are measuring 
the electric field at a given place, it oscillates with frequency v = c/X along a 
fixed axis, orthogonal to the direction of the wave propagation. At the same 
place, the magnetic field also oscillates with the same frequency, in phase with 
the electric one, but along an orthogonal transverse axis. When you take a 
snapshot of the wave at a fixed time, you see what the figure sketches: maxima 
and minima for both fields occur at the same place, separated by a wavelength 
A. The maxima and minima of the wave propagate at a fixed speed in vacuum, 
c = 1/,/fo€0; inside matter, this speed is reduced. 


increasing B increasing E 


a 


Figure 2.3 Left: A magnetic field B varying in time causes an electric field to 
curl around it. An electric field E varying in time causes a magnetic field to 
curl around it. Right: This leads to an electromagnetic wave propagating in 
space and time with the speed of light and a wave length A. The oscillations 
of the electric and magnetic field are in phase, the electric and magnetic field 
vectors are orthogonal to each other and to the direction of propagation v. The 
propagation velocity is equal to the speed of light, |v] = c, and the amplitudes 
of the two fields are related by Eo = cBo. 


After Maxwell’s publication the Prussian Academy of Sciences offered a 
prize in 1879 to whoever could demonstrate the existence of electromagnetic 
waves. However, the prize expired uncollected in 1882. That electromagnetic 
waves indeed exist was shown by the physicist Heinrich Hertz, the son of a 
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Hamburg senator, in a series of experiments collectively described in a paper 
from 1887 [50]. Just two years earlier he had been nominated professor at the 
Karlsruhe polytechnic institute, which offered excellent experimental facilities. 
Figure 2.4 shows examples of the emitters and the receivers he used. Both 
were based on a spark gap. When the emitter was excited to show a spark, 
the receiver also showed one, with an intensity diminishing with increasing 
distance. 


Figure 2.4 Foreground: An emitter oscillator used by Hertz for his transmission 
experiments. The two balls at the extremes regulate the capacitance and thus 
the resonance frequency of the open circuit. When the antenna is excited, the 
gap in the middle will spark and a short electromagnetic wave pulse will be 
emitted. Background: Receiver resonators used by Hertz, with different shapes 
and sizes. When the wave pulse arrives at the antenna, a spark will bridge the 
small gap. (Credit: Deutsches Museum, Miinchen, Archiv, BN02391, reprinted 
by permission) 


The existence of electromagnetic fields propagating by themselves in vac- 
uum has important implications for the interpretation of fields in general. 
Before Maxwell, one could have assumed that gravitational, electric and mag- 
netic fields were a mathematical device, simply splitting in two the unex- 
plained transmission of force from source to destination. Maxwell’s electro- 
magnetism and Hertz’ confirmation of the transmission of electromagnetic 
waves fundamentally changed the role of fields. Einstein [239] summarises 
this change of paradigm in 1931: 


... before Maxwell people conceived of physical reality —in as far 
as it is supposed to represent events in nature— as material points, 


“Reproduced with permission of the Licensor through PLSclear. 
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whose changes consist exclusively of motions, which are subject to 
total differential equations. After Maxwell they conceived physi- 
cal reality as represented by continuous fields, not mechanically 
explicable, which are subject to partial differential equations. 


The reason for the shift from total to partial differential equations is simple. 
Mass points are exhaustively described by their position as a (vector-) function 
of time alone, T(t), thus of a single free variable. Changes are thus described by 
a total derivative with respect to time. Fields are (vector-) functions of time 
and space, thus their evolution is described by partial differential equations 
involving the space-like partial differential operations of divergence and curl 
as well as the partial time derivative. 

The action of electromagnetic fields on charges was understood soon after 
Maxwell’s theory by Hendrik Antoon Lorentz [60]. In contrast to Heaviside, 
Lorentz was a classical scholar who received excellent academic training in the 
Netherlands. We will come back to his many contributions to physics later. The 
so-called Lorentz force is detailed in Focus Box 2.5. The electric field exerts 
a force on electric charges irrespective of their state of motion, proportional 
to the charge and the local field strength and along the field direction. The 
magnetic field exerts a force only on charges in motion, strength and direction 
are proportional to the charge itself and the crossproduct of the charge’s 
velocity with the field vector. This means that the strength is maximum if the 
magnetic field is orthogonal to the velocity, zero if the magnetic field vector 
points in the direction of the velocity. The magnetic force is also zero when 
the charge probing it is at rest. But at rest with respect to what? 

It may seem natural even today to assume that there is a universal reference 
system for space and time, a fixed stage on which things happen. The celestial 
and terrestrial motion due to gravity, the generation and action of electromag- 
netic fields could then all be described with respect to that absolute space and 
time. In the simplest case, this universal coordinate system would be somehow 
fixed to a substance that would be detectable, the ether. There were many vari- 
ants of ether theory. In general an ideal incompressible liquid was assumed, more 
or less frictionless with respect to the motion of matter, fixed to absolute space. 
Ether could be dragged along by the motion of matter or not, depending on the 
author’s ideas. Ether would also be the ideal medium to transport electric and 
magnetic fields (and maybe also gravitational ones). Fields would be transmit- 
ted in vague analogy to sound in gasses or liquids. However, this analogy hurts 
limits rather early on: sound is a density wave, a scalar quantity which varies 
longitudinally with respect to the wave’s evolution; electric and magnetic fields 
are vectors transverse to the direction of motion of an electromagnetic wave. It 
thus takes a more elaborate mechanical quantity, like shear stress, to transmit 
such phenomena. But theories of this kind were nonetheless accepted by promi- 
nent physicists, including Maxwell [38]: 


From these considerations Professor W. Thomson [Lord Kelvin] 
has argued, that the medium [ether] must have a density capable 
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In the absence of electric charges and currents, Maxwell’s equations take on 
a particularly simple and symmetric form: 


VE=0 ; VB=0 

+ « ôB 3 o OE 

E = - —— f B= — 

V x aL ; Vx Hoo at 

We take the curl of the second pair of equations: 

Gx (Gx8) = i 

~ OE ~ HOO Bp 
3 Ja o 3 e 3 2B 
Vx (v x B) = ITA x E = HOO A 


Using the vector identities V x (V x V) = V(VV) — V2V and the fact that 
neither E nor B has a divergence, we find the wave equations: 


Oe 8 18B = 
= -WE=0 ; =-=-V’B=0 
c2 Ot? 7 2 OF? 


with the speed of light c = 1/,/Roeo in vacuum. Using the four-vector potential 


A, = (V/c, A) with the so-called Lorenz gauge condition, 0,,A, = 0, the same 
equations become 0,0” A,, = 0. The solutions to the wave equations describe 
a monochromatic plane wave: 


Ë= Epe itkr-wt) . B= Bye Ere) 
with Ép L Bo L kand Eo = cBo, depicted in Figure 2.3. Its angular frequency 
is w = v/(2z7), its direction and wavelength are characterised by the wave 
vector k with k = 27/2. Equal phases of the wave are located in planes 
orthogonal to the direction of propagation, k/ k, thus the name plane wave. 
The observable values of the magnetic and electric field are of course given by 
the real values of these complex functions. The time averaged intensity of the 
wave is I = ceo Eĝ/2. In terms of the potentials, the plane wave solutions are: 


AY = Ab (k) ethno" 


with k, = (w, k) and kuk” = 0 so that the wave moves at the speed of light. 
The four-vector of the potential is thus orthogonal to the momentum four- 
vector, and it has only three degrees of freedom. 


Focus Box 2.4: Electromagnetic waves 
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Lorentz’ law describes how the local electric (E) and magnetic fields (B) act 
on a mass point with electric charge q moving at a velocity v: 


F=q(£+ox B) 


The electric field acts on charges at rest and in motion, in or against the 
direction of the electric field vector, depending on the sign of the charge. The 
magnetic force only acts on moving charges, its direction is orthogonal both 
to the velocity and to the magnetic field direction. Since the state of motion 
depends on the frame of reference in which it is described, one might suspect 
trouble with the distinction between these two fields. Indeed, the split into 
electric and magnetic forces is not invariant under Lorentz transformations 
(see Chapter 3). 


Focus Box 2.5: Lorentz’ law of the classical electromagnetic force 


of comparison with that of gross matter, and has even assigned an 
inferior limit to that density. We may therefore receive, as a datum 
derived from a branch of science independent of that with which we 
have to deal, the existence of a pervading medium, of small but real 
density, capable of being set in motion, and of transmitting motion 
from one part to another with great, but not infinite, velocity. 


However, at least Michael Faraday had doubts already in the 1850s. In an 
unpublished note [388] preserved at the Royal Society he wrote: 


The ether -its requirements— should not mathematics prove or 
shew that a fluid might exist in which lateral vibrations are more 
facil than direct vibrations. Can that be the case in a homoge- 
neous fluid? Yet must not the ether be homogeneous to transmit 
rays in every direction at all times. If a stretched spring represent 
by its lateral vibrations the ether and its vibrations -what is there 
in the ether that represents the strong cohesion in the line of the 
string particles on which however the lateral vibration(s) essen- 
tially depend. And if one tries to refer it to a sort of polarity how 
can that consist with the transmission of rays in every direction 
at once across a given ether. 


If the classical theorem on the addition of velocities holds, i.e. if they simply 
add up vectorially, an experiment on Earth cannot always be at rest with 
respect to the ether. Thus if the speed of light is measured in two different 
directions, a difference ought to be observed if light propagates in this medium. 
Several experiments, most prominently by Fizeau, Michelson and Morley, tried 
to establish this dependence on moving sources or moving physical media. 
They had no success. The speed of light is constant under all circumstance, as 
Maxwell’s laws predict. We will come back to the significance of these findings 
in Chapter 3. 
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2.2 RAYS 


Particle physics started with the discover of rays: cathode rays, X-rays and 
uranium rays. Cathode rays came first, because the technology to produce 
them was already in place in the form of discharge tubes. These consist of 
glass tubes which contain a rarefied gas, equipped with electrodes. When high 
voltage is applied to the electrodes, a discharge is triggered and colourful light 
is produced by the interaction between fast electrons and the rest gas, creating 
a plasma. They were named after the German glassblower Heinrich Geissler, 
who beginning in 1857 constructed artistic cold cathode tubes with different 
gases in them. Geissler tubes were mostly novelty items, made to demonstrate 
the new science of electricity. The technology was commercialised by French 
engineer Georges Claude in 1910 and evolved into neon lighting. 

In the middle of the 19th century, the nature of cathode rays was not known 
and attracted serious research interest. Pioneers were the Swiss Auguste de 
la Rive [40] and the German Julius Plücker. The former made the connection 
between the luminous effects of cathode rays and the aurora borealis [21]. The 
latter and his student Johann Wilhelm Hittorf worked with Geissler to develop 
more efficient vacuum pumps, since lowering the pressure at first enhances the 
luminous effect [32]. However, when a high enough vacuum was achieved, the 
light disappeared from most of the tube, instead a greenish glow appeared on 
the glass tube near the anode. Something was coming out of the cathode, hit 
the glass before being collected by the anode. The phenomenon was named 
cathode rays (“Cathodenstrahlen” ) by Eugen Goldstein [41]. Plücker also dis- 
covered that cathode rays could be deflected by magnetic fields. In parallel, 
similar phenomena were observed by the British physicist, chemist and spir- 
itualist Sir William Crookes (see [85]). We will see many more examples of 
parallel concurrent studies in Britain and on the continent during this period. 

Hertz, on the other hand, failed to deflect cathode rays by charged plates 
and wrongly concluded that they were more like electromagnetic radiation. 
He also observed that they could penetrate thin metal foils. Hertz’ assistant 
Philipp Lenard used this to construct a discharge tube with a thin metal 
window allowing the cathode rays to penetrate into open air, and studied 
their absorption by various materials. He was awarded the 1905 Nobel prize 
for physics for his work on cathode rays [123]. A few years later he also dis- 
covered that cathode rays were emitted when the cathode is illuminated by 
ultraviolet light. This is now called the photoelectric effect and played an 
important role in the development of quantum physics (see Chapter 5). Lenard 
was a nationalist and later became a fervent supporter of Nazi ideology (see 
Section 6.2). 

Following Lenard’s findings, Wilhelm Conrad Rontgen in Wurzburg 
started to be interested in cathode rays in 1895. Like many others, he used 
as detector a screen covered with barium platinocyanide, Ba[Pt(CN)a], a 
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substance which fluoresces® when hit by cathode rays. When he wrapped his 
cathode rays tube in heavy paper in a dark room, he accidentally discovered 
that the screen started to fluoresce even when cathode rays could not possibly 
have hit it. He introduced the first presentation of his findings to the Physical 
and Medical Society in Würzburg on December 28, 1895 [62] with the words: 


If the discharge of a larger Ruhmkorff [inductor] is passed through 
a Hittorf vacuum tube, or a sufficiently evacuated Lenard-, Crooke- 
or similar apparatus, and the tube is covered with rather tightly 
fitting thin black cardboard, one will see, in a completely darkened 
room, a paper screen painted with barium platinocyanide light up 
brightly at every discharge, fluoresce regardless if the painted or 
the opposite side of the screen is directed towards the discharge 
tube. The fluorescence is still visible at 2m distance from the appa- 
ratus. 


The new radiation was highly penetrating and attenuated only by dense 
material. It blackened photographic plates, was refracted by crystals, but not 
noticeably reflected. Röntgen concluded that the new rays presented a “kind 
of kinship to light.” He named the new rays “X-rays”. But already in the dis- 
cussion after his presentation [62], which was received with roaring applause 
by the audience, a colleague proposed to call them “Röntgen’sche Strahlen”, 
a name that has stuck in German, Russian and other northern European lan- 
guages. The importance of his discovery for medical imaging was immediately 
realised and his second communication [71] included an X-ray photo of the 
hand of the Society’s co-founder, an anatomist. 

Uranium rays were discovered by Henri Becquerel in 1896, again using 
photographic plates as a detector. Apparently he realised immediately that 
he was onto something important, because he notified the weekly meeting of 
the Académie des Sciences in Paris with a short note, barely one page [68]. 
The description of his experiment reads: 


One wraps a gelotino-bromide photographic Lumière plate with 
two sheets of very thick black paper, such that the plate does not 
grow hazy after an exposure to the Sun during one day. One puts 
on the outside of the paper a sheet of the phosphorescent’ sub- 
stance [in his case double potassium-uranium sulfate, K(UO)SO,] 
and one exposes the whole to the Sun, during several hours. When 
one then develops the photographic plate, one recognises that the 
silhouette of the phosphorescent substance appears in black on the 


5Fluorescence is the emission of visible or near-UV light by molecules which have been 
excited by X-rays or ionising radiation. 

6A dry photographic plate commercialised by the Lumiére brothers between 1890 and 
1950. 

”Phosphorescence is similar to fluorescence, but with a delay between irradiation and 
emission of light. The emitted glow slowly fades away, with a time scale ranging from 
minutes to hours. 
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picture. If one poses a coin between the phosphorescent substance 
and the paper, ...one sees the image of this object appear on the 
picture. 


In a series of communications to the Académie [64, 65, 66, 67, 69, 70] he elab- 
orated on his experiments. He found that the invisible radiation was able to 
discharge charged plates and penetrate various opaque substances. Also ura- 
nium salts which did not phosphoresce emitted the radiation, so unlike phos- 
phorescence it was independent of an outside energy source. Using metallic 
uranium he confirmed that it was this component of the salts which emitted 
the radiation. 

Marie and Pierre Curie, together with Gustave Bemont [81, 82, 83] discov- 
ered in 1898 other elements (like thorium and radium) contained in uraninite 
(called pitchblende at the time) and chalcolite, which emitted the “uranium” 
radiation much more intensely than uranium itself. Aware of the importance 
to publish new findings quickly in this very productive period, Marie Curie 
presented a short communication to the Académie. In there, she coined the 
term “radioactive” to characterise elements emitting the new radiation. She 
was unaware that a few months earlier, Gerhard Carl Schmidt had published 
similar findings in Berlin [80]. Becquerel and the Curies shared the 1903 Nobel 
prize for their discoveries of spontaneous radioactivity. 

It soon became clear that radioactive substances emitted more than one 
kind of rays. Ernest Rutherford classified the charged ones according to their 
penetration capabilities [86]. He named the components alpha- and beta-rays 
in order of increasing penetration power. The former were stopped by paper. 
The beta-rays penetrated paper, but not metal, and turned out to be the ones 
first detected by Becquerel. 

Villard [95, 96] and Becquerel [93] identified in 1900 a third component 
of radioactivity, which Rutherford [108] named gamma-rays. He and Edward 
Andrade [154] measured their spectrum in 1914 and found that they consisted 
of light with a much shorter wavelength than X-rays. 


2.3 ELECTRONS 


The nature of the newly discovered charged rays remained unclear and became 
a prominent subject of speculation and research. An important part of 19th 
century physicists imagined phenomena like the electric charge and current 
as consisting of charged particles. Prominently among them, Wilhelm Weber 
called the electric current a “liquid of electric molecules” [56]. On the basis 
of Michael Faraday’s law of electrolysis, Hermann von Helmholtz argued that 
there existed “atoms of electricity”, although he did not go so far as to iden- 
tify these units of electric charge with a particle. Moreover, in 1895 as a 
young graduate student, Jean Baptiste Perrin showed that when collected in 


8The history of Faraday’s electrochemical work and its interpretation is covered in [435]. 
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a Faraday cup, cathode rays would charge up the cup negatively [61]. He also 
succeeded in deflecting them magnetically. 

On the other hand, eminent physicists like Goldstein, Hertz, and Lenard 
thought that cathode rays were more light-like, i.e. associated with phenomena 
of the ether. This view seemed especially supported when Hertz, in one of 
the more famous wrong experiments, failed to deflect them electrically [47], 
because of the dielectric properties of the glass or conduction by the rest gas in 
his tube. It took progress in obtaining high vacuum by William Crookes [443] 
and others to prove him wrong. 

There were thus two fundamentally different interpretations of cathode ray 
phenomena, which motivated Joseph John Thomson? at the Cavendish Lab- 
oratory to make further and careful investigations in 1896-97. He announced 
his findings to the Royal Institution [74] on April 30, 1897. In the introduction 
to his more elaborate account in the Philosophical Magazine [75] he wrote: 


The experiments discussed in this paper were undertaken in the 
hope of gaining, some information as to the nature of the Cathode 
Rays. The most diverse opinions are held as to these rays; accord- 
ing to the almost unanimous opinion of German physicists they 
are due to some process in the ether to which -inasmuch as in a 
uniform magnetic field their course is circular and not rectilinear— 
no phenomenon hitherto observed is analogous: another view of 
these rays is that, so far from being wholly etherial, they are 
in fact wholly material, and that they mark the paths of parti- 
cles of matter charged with negative electricity. It would seem at 
first sight that it ought not to be difficult to discriminate between 
views so different, yet experience shows that this is not the case, as 
amongst the physicists who have most deeply studied the subject 
can be found supporters of either theory. 


The electrified-particle theory has for purposes of research a great 
advantage over the zetherial theory, since it is definite and its conse- 
quences can be predicted; with the zetherial theory it is impossible 
to predict what will happen under any given circumstances, as on 
this theory we are dealing with hitherto unobserved phenomena in 
the ether, of whose laws we are ignorant. 


In a series of experiments in the early 1890s, Thomson had already gathered 
information about the nature of cathode rays. He had e.g. determined that 
they were deflectable by a magnetic field not only close to the cathode but all 
along their trajectory. He had roughly measured their speed by a stroboscope. 
He found about 1.9 x 10%cm/s, large compared to the typical speed of ions 
in a discharge tube, but small compared to the speed of light [59]. In the 
1897 series of experiments [74, 75], he was able to more accurately measure 
both the speed and the charge-to-mass-ratio e/m of the rays using a clever 


°For a detailed history of this discovery, see [462]. 
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combination of electric and magnetic deflection shown in Figure 2.5. Today we 
would call such a set-up an electromagnetic spectrometer for charged particles. 
The rays emitted by discharge between cathode and anode on the extreme left 
of the tube pass through the cylindrical anode and enter into the elongated 
straight section of the tube, with constant velocity. They enter into a deviation 
region where both a vertical electric field and a horizontal magnetic field act 
on them. Adjusting the voltage applied to the parallel plate capacitor, the 
electrical deviation can be varied. Regulating the current in the Helmholtz- 
type magnet coils, the magnetic deviation can be changed. Adjusting both 
deviations to be the same, and knowing the necessary field strengths, the 
velocity of the rays and the ratio of charge to mass can be simultaneously 
measured. How this is done is detailed in Focus Box 2.6. The results Thomson 
obtained for the speed, using various rest gases and three tubes, confirmed 
his earlier conclusion. The mass-to-charge results were all in the region of 
—0.5 x 10°1!kg/C. Calculated by modern standards his result can be quoted 
as m/e = —(0.51+ 0.16) x 107'kg/C (or e/m = —(1.96 £0.31) x 10!!C/kg). 
As the number of electrons is very large, the error is entirely systematic in 
nature. 


Cathode (-) Anode (+) 


Cathode ray Charged 
plates 


Magnets “ eae on 
outside 
of glass 


Figure 2.5 Schematic of a cathode ray tube equipped with deflector 
electrodes and magnets [670]. (Credit: openstax.org, access for free at 
https: //openstax.org/books/chemistry-2e/pages/1-introduction) 


The speed of the rays can also be determined by using the fact that the 
potential difference U between the first anode and cathode of the discharge 
tube confers a fixed kinetic energy Ekin = Epot = qU to them, independent 
of cathode material or rest gas. If on the other hand the rays consisted of 
cathode or rest gas ions, the speed would depend on these materials. This 
was tested by Walter Kaufmann [72, 73], simultaneously to Thomson’s exper- 
iments. Kaufmann found a more accurate value of e/m = —1.77 x 10!!C/kg, 
close to modern values (see Focus Box 2.6) and with smaller errors. Similar 
results were obtained by Emil Wiechert in Königsberg [76]. Kaufmann, how- 
ever, shied away from firmly concluding that cathode rays were light charged 
particles emitted by the cathode, the same for all cathode materials. He merely 
called this conclusion “satisfactory”. Steven Weinberg suspects that this timid- 
ity is due to the strong influence of the Viennese physicist and philosopher 
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of science Ernst Mach [548], which was widespread at the time among Ger- 
man and Austrian physicists. Mach’s philosophy was that hypothetical entities 
like particles had no role in physics and only directly observable entities did. 
We will find similar attitudes when we discuss the interpretation of quantum 
mechanics in Chapter 5. 

That these results can be obtained in a consistent way does in fact show 
that cathode rays consist of the “electrified particles” mentioned in Thom- 
son’s introduction, emitted by any cathode material. Thomson concluded 
boldly [75]: 


...we have in the cathode rays matter in a new state, a state in 
which the subdivision of matter is carried much further than in the 
ordinary gaseous state: a state in which all matter -that is, matter 
derived from different source such as hydrogen, oxygen, etc.— is 
of one and the same kind; this matter being the substance from 
which the chemical elements are built up. 


He thus correctly speculated that the new particles were a component of all 
matter, yet existed also outside matter [99]. Thomson called them “corpus- 
cles”. We call them electrons, following George Johnstone Stoney [58] who 
introduced the term to denote an “atom of electricity”. 

This discovery can be taken as the starting point for two tremendously 
important lines of development: particle physics as a fundamental research 
subject on one hand; electronics as a technology pervading our daily life on 
the other hand. 

In 1900, Henri Becquerel measured the mass-to-charge ratio of beta 
rays [92] from radioactive samples using a spectrometer analogous to Thom- 
son’s. He found that it was the same as the m/e of cathode rays. Thus beta 
particles were also identified as electrons. Kaufmann, like essentially all other 
physicists then changed sides [102], the electron as a real particle was accepted. 


2.4 QUANTUM CHARGE 


Let us jump ahead by a few years to see how the measurement of the elec- 
tron e/m was converted to separate values for the two quantities, by a direct 
measurement of the elementary electric charge e. Important as the electron 
charge and mass are all by themselves, the impact of this measurement goes 
beyond. As Weinberg points out [548, chapter 3], the measurement of the elec- 
tron charge opens the door to the atomic scale. Ratios of the molecular masses 
were known for many chemical elements by the work of John Dalton and the 
followers of his molecular theory (see Chapter 4). Likewise, ratios of charge 
and mass were known for many ions, mostly through electrolytic determina- 
tion. Considering crystals and metals to be densely packed arrangements of 
atoms, the ratio of their mass to their volume could also be estimated. The 
single isolated measurement of e gave access to a host of atomic and molecular 
characteristics. 
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They enter the force region with a horizontal velocity v. With a con- 
stant force F orthogonal to v applied in this region during a time t = l/v, 
they exit with a perpendicular velocity component: 


Since Uperp K v, they take a time T > L/v to traverse the field-free drift 
region. The displacement d of the cathode ray beam (largely exaggerated in 
the sketch) is thus: 
Ll 

d=TUperp = mo? 
Thomson had a parallel plate capacitor inside the evacuated glass tube, with 
a roughly constant electric field E pointing vertically upward, exerting a con- 
stant force eE pointing downward. Solenoidal magnets outside the tube cre- 
ated a roughly constant magnetic field B in the horizontal direction pointing 
towards you. The magnetic force was thus evB. Both resulted in displace- 


ments: Ll Ll 
del = 2 p ; dmag = 2E 
m v m v 


Measuring the two displacements, we can solve for the unknowns v and e/m: 


aft dmag Oom B?Ll / da 

TÄB)\ a)’ ee E (Bu 
With Thomson’s set-up and the electric and magnetic fields he used, the 
displacement was a few centimetres. He obtained roughly correct values of 
e/m around —2 x 101! C/kg. Todays value recommended by the CODATA 


group is e/m = —1.75882001076(53) x 101! C/kg (see www.codata.org). A 
gram of electrons thus has a negative charge of almost 200 million Coulomb. 


Focus Box 2.6: Thomson’s measurement of the electron e/m 
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It is thus not astonishing that eminent researchers involved themselves 
in trying to measure the charge separately from mass. In the early years 
of the 20th century, Charles Thomson Rees Wilson invented the expansion 
cloud chamber, in which a saturated water or alcohol vapour is condensed by 
adiabatic expansion (see Chapter 8 for more details on this technology and its 
use). It was noted that the droplets of the resulting “rain” could be charged 
by picking up gas ions produced by ultraviolet light or X-rays (see Chapter 5 
for this mechanism, the photoelectric effect). Creating the vapour inside the 
electric field of a parallel plate capacitor, charged droplets are attracted by 
the plate of the opposite charge. Because of the frictional force exerted by 
the gas in which the vapour is suspended, droplets will drift with a constant 
velocity after a very short period of acceleration. Their drift velocity is then 
simply proportional to the total exerted force. Measuring the drift velocity 
under gravity alone and under the sum of gravity and electric field, one can 
deduce the electric charge. For a whole cloud, this will then correspond to the 
average charge of the drops inside. J.J. Thomson [84, 91, 110] and Harold A. 
Wilson [113] were pioneers of this technique. 

The Chicago physicist Robert A. Millikan had learned about this method- 
ology during a visit to Cavendish lab. Upon return he started his own investi- 
gation by narrowing down the velocity measurement to single drops in a similar 
experimental set-up. He used water droplets condensed in an expansion cloud 
chamber moving with and without electric field [134], and ions produced by 
X-rays from radium exposure. He confirmed that all droplet charges were mul- 
tiples of an elementary charge e with good accuracy. However, he also realised 
that there were important systematic limitations to the accuracy of this mea- 
surement, most prominently by air turbulence and convection following the 
expansion and mass change of the droplets by evaporation during the velocity 
measurement. 

He thus improved the method [139], using mineral oil as a less volatile 
substance, charged by friction in an atomiser, as shown in Figure 2.6. His set- 
up let single drops fall into the capacitor through a hole, thus separating the 
turbulent inlet volume from the calm air between the plates. Ultraviolet or X- 
rays ionised the air between the plates, giving the droplets a chance to pick up 
further charged ions!°. He proudly states in the end of his paper that “so far 
as I am aware, there is no determination of e ... by any other method which 
does not involve an uncertainty at least 15 times as great as that represented 
in the above measurements.” Of course he also realised the importance of his 
precision measurement in determining other fundamental constants, which he 
evaluated in a separate section. A comparison with modern determinations of 
e makes little sense, since in today’s SI system e serves as the definition of the 


10Blectrons freed by ionisation are very mobile and unlikely to be picked up by the 
droplets. 
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The balance of forces in Millikan’s oil 
drop experiment is shown in the adja- 
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the drop is falling freely inside the | . 

shorted capacitor. After a short accel- 

eration by gravity, an equilibrium with 
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The downward gravitational force is diminished by buoyancy, such that F} = 
(47/3) R?(poit — Pair)g, with the drop radius R and the mass densities p. 
The gravitational acceleration, g = GMẹ/r > 9.81m/s, is given by the 
mass and radius of the Earth and the gravitational constant G. Gravity is 
compensated by the upward frictional force Fy. For a perfectly laminar flow 
around a spherical drop, Stoke’s law gives F4 = 6T Hair Rv, proportional to its 
velocity v and pointing in the opposite direction. The viscosity of the medium 
is u. In equilibrium, the two forces are equal and opposite. Solving for v we 
get the drift velocity of the falling drop: 


— 2 Poil — Pair 


R? 
9 Hair á 


U1 
When the capacitor is charged such that there is an upward force on the drop, 
the equilibrium sketched on the right is reached. The upward electrical force 
on a singly charged drop, Fe = eE, is balanced by the sum of the gravitational 
force and the frictional force, which now points downward. The equilibrium 
is reached for Fe = F, + Fy and corresponds to the rising drift velocity v2. 
Inserting the known forces and solving for e we get the relation used (and 
corrected) by Millikan [139]: 


1 
M e Gay 1 )' (vı + w)o? 
3 2 g(Poil = Pair) E 


In his systematic approach to the problem, Millikan found a deviation from 
Stokes’ law used above. It occurs for drop radii which are smaller than the 
mean free path of molecules in the gas. He took this linear deviation into 
account in his final results [152]. 


Focus Box 2.7: Millikan’s oil drop experiment to measure the electron charge 
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Figure 2.6 Schematic of the Millikan oil drop experiment [670]. 
(Credit: openstax.org, access for free at https://openstax.org/books/ 
chemistry-2e/pages/1-introduction) 


charge unit Coulomb, and the translation into the CGS units of 1911 involves 
the speed of light (also fixed in SI units)". 

Needless to say that such an important measurement was up for scrutiny 
by concurrent methods. The Austrian Felix Ehrenhaft indeed claimed to 
have found charges smaller than e and not in multiples of an elementary 
charge [131]. Millikan kept refining his method, especially measuring and 
parametrising deviations from Stokes’ law for small drop radii. In his final 
publication on the subject [152], he discussed the systematics of his measure- 
ment in depth. Among all his notebooks, he kept the one with these results; it 
is conserved at the Caltech library and can be consulted online!?. Millikan’s 
result stood up to all challenges. He was awarded the 1923 Nobel prize for 
physics for this work and his research on the photoelectric effect (see Chap- 
ter 5). 

Strictly speaking, what was measured is the electric charge of ions, i.e. of 
molecules stripped by a few of their electrons or with extra electrons attached. 
The greatest common denominator of these charge measurements is thus equal 


!lFor a lively and interesting discussion of units for physical properties see the blog 
blog.wolfram.com of Nov. 16, 2018. 
12 caltechin.library.caltech.edu/7/1/Millikan_1.pdf 
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to the electron charge if and only if atoms are strictly neutral. Millikan of 
course realised that this assumption needed experimental verification [174, 
p. 80-83]. A few of his drops first had a negative charge, he then succeeded 
to charge them positively by X-ray exposure (photoelectric effect, see Sec- 
tion 4.3). By direct comparison he concluded that common denominators of 
positive and negative charges were the same to one part in 2000. This is indeed 
confirmed today with staggering accuracy. Modern measurements of the neu- 
trality of atoms find that |qe — q|/e < 10 x 10~?', by showing that an AC 
excitation of a gas does not cause an acoustic resonance [584]. 


FURTHER READING 


Edward A. Davis and Isobel J. Falconer, J.J. Thomson and the Discovery of 
the Electron, Taylor and Francis, 1997. 


Jed Z. Buchwald and Andrew Warwick (Edts.), Histories of the Electron: The 
Birth of Microphysics, MIT Press, 2001. 


Steven Weinberg, The Discovery of Subatomic Particles, Cambridge University 
Press, 2003. 


Taylor & Francis 
Taylor & Francis Group 


http://taylorandfrancis.com 


CHAPTER 3 


Relativity 


...not only in mechanics, but also in electrodynamics, no proper- 
ties of the phenomena correspond to the concept of absolute rest, but 
rather that for all coordinate systems for which the mechanical equa- 
tions hold, the equivalent electrodynamical and optical equations hold 
also ...In the following we will elevate this guess to a presupposition 
(whose content we shall subsequently call the “Principle of Relativ- 
ity”) and introduce the further assumption, -an assumption which is 
only apparently irreconcilable with the former one- that light in empty 
space always propagates with a velocity V which is independent of the 
state of motion of the emitting body. 


Albert Einstein, Zur Elektrodynamik bewegter Körper, 1905 [118] 


N THIS CHAPTER I discuss the consequences of the fact that no absolute 

space-time exists, but that all physics laws should be the same in all ref- 
erence frames. Newton had indeed shown this to be the case for mechanics 
in 1687 [1, Corollary 5], for reference systems moving relative to one another 
at constant velocity. These are called inertial systems since the inertial resis- 
tance of a body to an accelerating force will be the same, i.e. Newton’s law will 
hold the same way. The fact that only relative velocity matters is often called 
Galilean relativity since it is based on the homonymous transformations of 
coordinates (see Focus Box 3.4). Newton assumed that there was a “natural” 
coordinate system, absolute space and time, with respect to which all others 
would be defined, the ether. 

It was thus logical to assume that when Newton’s law holds in any inertial 
coordinate system, so should Maxwell’s laws. This is what Einstein calls the 
“principle of relativity” in the above quote. This cannot be achieved with- 
out modification, since Newton’s law requires that velocities simply add up 
vectorially, in obvious conflict with the speed of light being the same in all 
reference frames. Thus a light ray emitted from a fast train evolves with the 
same speed when seen from the train and from the train station. We will see 
how Einstein modified classical mechanics to admit this, while preserving the 
principle of relativity. But beforehand I will review experimental evidence that 
both facts hold. As examples I describe three experiments: the one by Fizeau, 
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the well-known Michelson and Morley experiment and the one by Trouton and 
Noble. I then describe special relativity, which refers to inertial systems. Gen- 
eral relativity, covering all coordinate systems, is also briefly covered, since it 
is necessary to understand cosmological facts detailed in Chapter 10. 


3.1 ETHER 


In 1851, Hippolyte Fizeau set up an experiment to measure the speed of light 
c in flowing water [19]. He used an interferometer to measure the difference in 
speed between light propagating in and against the direction of water flowing 
through a glass pipe with velocity v, as sketched in Figure 3.1. Taking into 
account that light propagates slower in a medium with refractive index n, 
he expected a modification according to the Galilean addition of velocities, 
c/n+v in the two arms. Fizeau indeed observed a small shift of the interference 


Figure 3.1 Sketch of the Fizeau experiment to measure the speed of light in 
flowing water. The dotted light path always goes with the water flow, the 
dashed one always against it. Their interference pattern is observed on the 
right. If the speed of light c were influenced by the movement of the medium, 
the interference pattern would change when the water speed v is varied. Indeed 
it does, but only by a very small amount. 


pattern, but the velocity of the water only entered reduced by a large factor, 
c/n+v(1—1/n?). This had been anticipated by Fresnel for a medium at rest 
with respect to the ether [422]'. In 1886, Albert A. Michelson and Edward 
W. Morley confirmed the experiment with an improved apparatus [48]. The 
results clearly showed that the velocity of a medium does not simply add to 
the speed of light. 

Thirty years after Fizeau’s experiment, the American physicist Albert A. 
Michelson set up an interferometer in Berlin, shown in Figure 3.2, to com- 
pare the speed of light in two orthogonal directions [45]. The idea behind this 
experiment was the following: the Earth moves around the Sun with a velocity 


I The correct explanation for moving dielectrics was finally found by the Dutch Pieter 
Zeeman after the formulation of special relativity [162, 167]. 
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Figure 3.2 Left: Interferometer of the 1881 Michelson experiment in Berlin and 
Potsdam [45]. Right: Sturdy interferometer mounted on a stone slab floating 
on a mercury bed, used by Michelson and Morley in 1887 in Cleveland [51]. 
(Credit: American Journal of Science, reproduced by permission) 


of about 30 km/s. If a luminiferous ether were at rest with respect to any- 
thing else than the Earth itself, comparing the speed of light in the direction 
of rotation and orthogonal to it would reveal a difference. Tiny differences 
in speed are sensitively detected (or not!) by interferometry, as Fizeau had 
demonstrated. However, vibrations of the ground in busy Berlin disturbed 
the sensitive Michelson apparatus. Even moving it outside the city to Pots- 
dam did not solve the problem. A much more sturdy set-up, also shown in 
Figure 3.2, was thus mounted on a stone slab by Michelson and Edward W. 
Morley in Cleveland/Ohio in 1887 [51]. Focus Box 3.1 explains how it works. 
The experiment convincingly showed that there is no dependence of the speed 
of light on the direction in which it is emitted. Since the Earth constantly 
moves with respect to the Sun (which is not at rest with respect to other stars 
either), it is not conceivable that an ether transmits electromagnetic waves. 
Electromagnetic waves do not require a medium, they propagate in empty 
space. In 1930, Georg Joos repeated the experiment [238] with unprecedented 
precision and confirmed the result. It is thus clear that neither linear nor 
quadratic influences of motion on c exist. It is simply the same in any system 
of reference. 


3.2 MOVING FRAMES 


We had noted in Chapter 2 that the split into electric and magnetic fields 
cannot be independent of the reference frame. Charges must be moving to 
make a current which causes a magnetic field. The current is proportional to 
both their spatial density and their velocity, so is the magnetic field. To feel 
a force from this magnetic field, the probe must also be moving. Again, the 
force is proportional to the probe velocity. When we fix our reference frame 
to the moving probe, magnetic forces magically disappear. What happens to 
the principle of relativity? 
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u eke The sketch on the left shows the princi- 
ple of Michelson’s interferometer. The light 

N from the source partially passes the semi- 
transparent mirror onto the movable mir- 

mierea | eae ror where it is reflected. On its way back 


pa "á _ = it is again (partially) reflected towards the 
observer. The other part of the source light 


A Se is reflected into the right arm of the inter- 
ferometer, where it meets the fixed mirror 
J isnt source and passes back towards the observer. 


The classical theorem of Galilean relativity predicts that two velocities add 
up vectorially. If the Earth moves parallel to the right arm, through an ether 
(assumed to be at rest) with velocity v, the speed of light c will be enhanced on 
the way towards the fixed mirror and reduced on the way back to the observer. 
For the upper arm, the velocity of the interferometer will be orthogonal to the 
light path, such that the speed of the light ray will be c, but the path slightly 
increased by the relative motion. The times of arrival of the two beams with 
respect to an arbitrary time of emission will thus be: 


L L 2L 1 2L 2L 1 


c+v c-v ¢ 1l-v/e C2 — ve ce fl—w/2 


L is the length of both arms, adjusted to be the same. The time difference 
between the two arms for this direction of motion will thus be: 


al 1 1 
te 1-w/e fl —v?/e2 


Now we rotate the interferometer by 90°; the upper arm will now be in the 
direction of motion of the Earth, the right arm will be orthogonal to it. Con- 
sequently, the two terms in the bracket will be interchanged. The rotation will 
thus change the phase between the two light rays by a time difference: 


2 
S ~-t] 1 1 )-2: 


c \1-%/2 ‚/1- 02/2 cc 


The approximation is valid in the limit of v < c, which is certainly the case 
since Earth moves inside the solar system with a velocity of about v ~ 30 km/s, 
compared to the speed of light c ~ 300,000 km/s. The corresponding phase 
shift in units of wavelength will thus ben = (c/A) (A; — Aa) > (2L/A)(v?/c?). 
This is the fraction of a line spacing that the shift should amount to. None is 
observed. 


Focus Box 3.1: The experiments of Michelson and Morley 
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A demonstration of the principle of relativity in electrodynamics is already 
included in Faraday’s experiments on induction [511]. Faraday analysed three 
cases: 


e A conductor is moved through the inhomogeneous field of a magnet at 
rest. 


e The magnet and its field are moved through the conductor at rest. 


e With both magnet and conductor at rest, the magnetic field is varied in 
time. 


In each case, the voltage or current induced in the conductor is measured, 
thus the induced electric field. What happens is explained in modern terms in 
Focus Box 3.2. The three cases are found to be qualitatively and quantitatively 
equivalent, what matters is the relative motion, leading to a variation of the 
magnetic flux in time as in the third case. In other words: Faraday’s law holds 
(see Focus Box 2.3). 

When the conductor is at rest, the charges in it do not move. How can they 
be sensitive to a magnetic field? The answer is that the changing magnetic 
field now looks to them like an electric one, as shown in Focus Box 3.2. The 
relative role of electric and magnetic field depends on the reference frame, but 
the force that acts remains the same. 

Trouton and Noble searched for effects of this transformation from electric 
to magnetic fields in 1903 [112, 111], as explained in Focus Box 3.3. Classically, 
a dipole should feel a torque when moving at a constant velocity and turn 
until it is orthogonal to the direction of motion. No such torque is observed, 
in agreement with Einstein’s first axiom, the principle of relativity. 

The negative results of these two experiments, among others, led Hendrik 
A. Lorentz in 1904 [114] to derive a set of transformations from one inertial 
system to another that would accommodate these facts. He came up with the 
spectacular result that moving length scales would be contracted when seen 
from an observer at rest. Likewise, time would depend on the relative motion 
of two systems. We detail the mathematics of these Lorentz transformations 
in Focus Box 3.5. They deform the electric field of a moving electron. The 
effects are of second order in v/c, the ratio of the relative speed of the two 
systems to the speed of light, and explain the absence of effects in Trouton 
and Noble’s experiment. Lorentz still believed, however, that there was an 
absolute space-time, which defines the “right” lengths and times. 

Lorentz’ work was generally taken as a theory of the electron, with his 
results on the deformation of the field considered as due to a change of its 
shape. It was generalised to gravitational properties of the electron by Henri 
Poincaré in 1905 [121], who concluded that also the gravitational field would 
propagate at the speed of light, in the form of gravitational waves. 


36 mM Particles, Fields, Space-Time 


We consider the first two cases of Faraday’s induction experiments, once fixing 
our coordinate frame to the magnet, (r,t), and then to the conductor, (7’,t’). 
Magnet frame: In the rest frame of the magnet, the magnetic field is inho- 
mogeneous but fixed, B(F). The electric field is zero. According to Lorentz’ 
law (see Focus Box 2.5), the force on a particle of charge q and velocity U (as 
given by the motion of the conductor) is F= qu x B. 

Conductor frame: In the conductor frame, there is a time-varying mag- 
netic field B’ related to the (fixed) magnetic field B in the magnet frame: 


B'(F' t) = BF’ +0). In this frame, there is an electric field: V x Æ = — oF 
We thus find that E’ = g x B. A charge q in the conductor is at rest in the 
conductor frame, thus F” = gE’ = q7 x B=F. 

To first order, the force is thus the same in both frames, so are induced voltage 
and current. However the force is seen to be electric in the conductor frame, 
magnetic in the magnet frame. 

When using Lorentz transformations between the two reference frames, effects 
of second order in v/c are revealed. For the conductor frame we find E’ = 
yë x B, F! = qE' = qyë x B, with y = 1/\/1 — v?/c? and the speed of light 
c. The relativity principle is thus preserved in that Maxwell’s equations have 
the same form in both frames. 


Focus Box 3.2: Relativity in induction 


3.3 SPECIAL RELATIVITY 


The concept of absolute space and time was radically abandoned in Albert 
Einstein’s June 1905 paper “On the electrodynamics of moving bodies” [118], 
as seen in the introductory quote to this Section. Einstein motivates his study 
by the facts we mention above. But his paper does not have a single reference, 
since “it is known that Maxwell’s electrodynamics —as usually understood at 
the present time— when applied to moving bodies, leads to asymmetries which 
do not appear to be inherent in the phenomena.” Instead “the unsuccessful 
attempts to discover any motion of the earth relatively to the ‘light medium’ 
suggest that the phenomena of electrodynamics as well as of mechanics possess 
no properties corresponding to the idea of absolute rest.” Einstein traces back 
the problems to a lack of understanding for simultaneous processes. What 
is to be understood by at the same time when one deals with two different 
coordinate systems in relative motion? How does one attribute a time to an 
event elsewhere when absolute time and space do not exist? 

The answer is that one needs a prescription for the synchronisation of two 
clocks. Let us assume, like Einstein does in Section 2 of his paper, that there 
are two identical clocks in two places A and B. We now emit a light signal from 
point A at time ta (measured by the A clock). It is reflected from point B at 
tg (measured by the B clock) and arrives back at A at time t’, (measured by 
the A clock). The two clocks are synchronous if tg—t, = t’, —tg. In addition, 
the speed of light defines the relative units of space and time, because twice 
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To understand the idea of the experiment conducted by Trouton and Noble, 
we consider an electric dipole, which moves with a constant velocity v in the 
horizontal direction with respect to the laboratory system. The charge +q 
at its upper end causes a current / in the laboratory frame, which does not 
exist in the rest frame of the dipole. The magnetic field caused by the current 
results in a force F pointing vertically downward, and felt by the lower charge. 
Likewise, the lower charge causes a current 
flowing in the opposite direction, the cor- A 
responding magnetic field pulls the upper PA] 
charge upward. In the laboratory system, Aa j 
the dipole should thus feels a torque align- |, 

ing it orthogonal to its direction of motion. ar 

No such torque is observed. 
A magnetic torque also exists when treating the prob- 
lem relativistically. However, a second force is to be 
taken into account, which has no effect in the non- 
relativistic limit. As the sketch on the left shows, the 
Coulomb force K in the rest frame of the dipole acts 
in the direction of its axis and has no effect on its 
orientation. 

Seen from the laboratory system, however, the vertical coordinates do not 
change, but the horizontal ones are contracted. The electrostatic force is thus 
no longer parallel to the dipole axis and compensates the magnetic forces. 
There is thus no resulting torque, in agreement with observation. 


N. 1 
B 


Focus Box 3.3: The Trouton and Noble experiment 


According to Newtonian mechanics, the transformation from a system of coor- 
dinates at rest to a moving system is given by the Galilean transformation: 


y y When the system (2’,y’,z’) moves at con- 
stant velocity v in the +a direction of system 
EN (x,y,z) and the origins of the two coïncide for 
A t = t' = 0 we have: 
> ne t=t;0= (x +p); y=y; z= 


L Fa * with B = v/c. 


This leaves lengths AF = 7, — ra and elapsed times At = tı — ta invariant, i.e. 
Ar = Ar’, At = At’. Velocities add up vectorially, such that the velocity u’ 
of a mass point in the moving system will be measured in the system at rest 
as U: 

Ur = uh +U ; Uy = Uy ; Uz =u, 
Momentum p = mü and kinetic energy E = mu?/2 measured in the two 
systems behave accordingly. 


Focus Box 3.4: Galilean transformations 
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According to Einstein’s special relativity, the transformation from a system of 
coordinates (x,y,z) at rest to a system (x’,y’,z’) moving at constant velocity 
v in the x direction (like in Focus Box 3.4) is given by the Lorentz transfor- 
mation: 


c= y(cet'+ Br) ; wv =y(at+Bct) ; y=y ; z= 


with 6 = v/c and y = 1//1- v?/c?. When a light ray connects two events, 
zı = (ctı, T1) ans z2 = (cta, £2), it takes the same elapsed time in any system: 


((a1 — 22)? + (y1 — ye)? + (z1 — 2)?)? = c(tı-b) 


This transformation thus leaves the space-time length ys, defined by s = 
et? — F?, invariant. Time and space thus form a four-vector (see Focus 
Box 3.6). Lorentz transformations are the rotations and translations in space- 
time, which leave s invariant. Since s is to be positive definite, all velocities 
have to be smaller than the speed of light. Lengths and elapsed times, on the 
contrary, are not invariant separately. Instead we have time dilatation and 
length contraction in the direction of relative motion: 


At=yAt ; Ar=vyAad! ; Ay=Ay ; Az=Az’ 
Y 


Accordingly, velocities do no longer add up simply. Instead we have: 
u, +v Uy u, 
Us = zuo Sa) S Y Tale) 
T+ u,v/e IA + uno je) TENATA 
Another important four-vector is energy-momentum, (E, cp). Its length m 
gives the mass of the corresponding object: m?c? = E? — c?p?. For an object 
at rest in a given system (p = 0), we find back Einstein’s famous formula for 


the equivalence of mass and energy, E = me. 


Focus Box 3.5: Lorentz transformations 
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the distance between A and B, 2r,p, is traversed by the light ray in the 
period t’, — ta, such that 2r4g/(t', —ta) = c, a universal constant. When the 
second clock is moving, this leads to a loss of synchronism, unless redefined 
by light rays. 

With this definition of synchronism, Einstein derives the Lorentz trans- 
formations between two inertial systems in Section 3, without reference to 
Lorentz’ work. Lengths are defined by rulers at rest in any given systems. 
When one needs to known them in a moving system, they must be defined 
by synchronous clocks at both ends. In summary, lengths in a moving system 
appear contracted in a system at rest. Times measured in a moving system 
appear longer than measured by a clock at rest?. Of course the role of “the 
moving system” and the “system at rest” can be readily reversed, since there 
is no absolute rest. Thus symmetry is restored, only relative motion matters. 

In the second part of his paper, Einstein applies these findings to electro- 
magnetism and derives the transformation properties of electric and magnetic 
fields. He concludes his paper by looking into the consequences of his findings 
for the energy of a moving body and its resistance to acceleration, its iner- 
tia. This is treated more in-depth in a short paper submitted in September 
1905 [117]. There he considers the energy balance of a moving body which 
emits light of energy E. His result is that indeed its inertial mass reduces by 
Am = E/c?. This is the first example of the famous equivalence of mass and 
energy, E = me. 

The theory of special relativity found an almost immediate support by 
leading authorities like Max Planck. Especially Hermann Minkowski [128, 
165] strongly emphasised its disruptive character and introduced the four- 
vector notation of relativistic kinematics that we use throughout this book 
and explain in Focus Box 3.6. It allows for a particular elegant and com- 
pact representation of Maxwell’s equations, which is quoted in Focus Box 3.7. 
Today, special relativity is used and confirmed in every experiment using par- 
ticles travelling at a velocity close enough to the speed of light. A popular 
example is the fact that unstable particles created by cosmic rays high up in 
the atmosphere reach Earth despite their short lifetimes [295], due to time 
dilatation. 

Special relativity connects inertial systems, those on which no forces act. 
But do such systems actually exist? Earth is constantly accelerated by the 
Sun’s gravitational force such that it stays on its elliptical path, so no coor- 
dinate system fixed on Earth is really inertial. The Sun is also rotating and 
attracted by other stars in the Milky Way, so it cannot hold an inertial system 
either. Since the range of the gravitational force is infinite, there is indeed no 
object in the whole Universe on which it does not act. So Einstein asks a valid 
question [280, p. 223]: “Can we formulate physical laws so that they are valid 
for all coordinate systems, not only those moving uniformly, but also those 


?Please keep in mind that we measure elapsed time by counting “ticks” of a clock. When 
the second is shorter, we measure a longer time. 
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Covariant four-vectors transform like the space-time four-vector x, under 
Lorentz transformations: 


Lp = (%0,%1,%2,23) = (ct, x,y, Zz) 
An important example is the energy-momentum four-vector p” : 


Pu = (Po, P1, P2, p3) = (E, Cpr, CPy, CPz) 


Contravariant four-vectors x, are related to covariant ones by the metric ten- 
sor gt”: 


1 0 0 0 
0 -1 0 0 
H ppv $ HV — 
T =g Lv $ g a 0 0 = 0 
0 0 0 —-1 
xt = (ee) = (et, =z, —y, —2) 


| 


p” = (p°, p*, p°, p°) (E, —cpz, —cPy, —Cpz) 


In Minkowski space, the scalar product between two four-vectors is: 


Turt = ) Tur” = PP - Peer 
u 
22,2 229 299 _m2_ 252 20 4 
Pup! = E*—cp,—cp,—cCp, = ET - cp” = mc 
prt! = Ect — cp,x — cpyy — cpzz = cEt — cp € 


The first equation defines squared length, the second squared mass (see Focus 
Box 3.5). The third product appears e.g. in equations describing the evolu- 
tion of particle systems. We use the convention that when upper and lower 
greek indices are the same, summation is implicit. Scalars are invariant under 
Lorentz transformations, they stay the same when changing from one inertial 
frame to another. 


Focus Box 3.6: Minkowski’s four-vectors 
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The electromagnetic charge and current density as well as the electromag- 
netic potentials form four-vectors, transforming like space-time under Lorentz 
transformations: 


=. 


Ju = (cp, 5) ; Ay = (Ve, A) 


The time-like element of j, is the charge density p, i.e. the electric charge 
per unit volume. Its space-like components are the current density J, i.e. the 
electric charge per unit time flowing through a unit surface. Maxwell’s laws 
(see Focus Box 2.3) connect it to the electromagnetic four-vector potential A”. 


Its time-like component is the scalar electric potential V, with E=-VV- a ; 
its space-like component the magnetic vector potential A, with B=VxA. 
Introducing the electromagnetic field tensor F,,, and using the four-gradient 


ð, = (Ot/c, V): 


0 E,/c Ey/c Ez/c 
-E,/c 0-B B; 
-E,/c B, 0 -B 


He B, 0 


Faw = Av — WAp = 


one can condense the four Maxwell equations into one: 
o! F, uv = HoJv 


In words: the local current density causes the field tensor to diverge. 


Focus Box 3.7: Maxwell’s equations in Minkowski space 
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moving quite arbitrarily, relative to each other?” Einstein himself gave the 
answer: general relativity. 


3.4 EQUIVALENCE 


In Chapter 2 we have tacitly introduced an important symmetry between 
Newton’s law linking force and acceleration and Newton’s law of universal 
gravity: the mass m, denoting inertial mass in the first and gravitational mass 
in the second, is the same. It means in particular that the same weight of 
different materials shows the same inertia. This fact is far from trivial and 
Newton checked on it using a pendulum. He measured the period of its motion 
with identical weights of materials, like metals, glass, sand, salt, water and 
even wood or grain. He found no difference, thus establishing that the inertial 
mass of all these materials is the same. In fact the period of the pendulum, 7 = 
2r.\/L/g, depends only on its length if and only if inertial and gravitational 
mass are the same. You are familiar with this result from your physics course 
at school, where a stone and a feather take the same time to fall through an 
evacuated glass tube. The masses of the two objects are vastly different, but 
so are the gravitational forces. Mass cancels, the acceleration g is the same, so 
is then the velocity v(t) and the total time the fall takes (see Focus Box 2.1). 

Friedrich Bessel (better known for the periodic functions named after him) 
later improved on these experiments using his own version of the Kater pendu- 
lum [12]. Towards the end of the 19th century, the Hungarian physicist Roland 
Baron Eötvös found that modern technology warranted a measurement many 
orders of magnitude more precise [54]. He used the fact that in the northern 
hemisphere the centrifugal force due to the Earth’s rotation slightly deviates 
the direction of the gravitational force towards the south (see the left sketch 
in Figure 3.3). The centrifugal force is of dynamical origin, F = mw?r with 
the inertial mass m, the angular velocity w of the rotation and the distance 
r to the Earth’s rotational axis. In Budapest, where Eötvös conducted his 
experiments, it deviates about 5’56” from the vertical direction. If one orients 
the horizontal bar of a torsion balance along the east-west direction and uses 
different materials of the same weight on both ends, the balance feels a torque 
if a difference in inertial mass exists. Using a mirror to project a light ray over 
a large distance, a very sensitive angular measurement can be achieved. The 
torsion balance Eötvös used is shown on the right in Figure 3.3. He estimated 
its angular accuracy to be better than 1.6 x 10~° are seconds. Eötvös found 
no difference when comparing the inertial mass of materials as different in 
density as brass and cork. He concluded that inertial and gravitational mass 
are the same to one part in 20 million. 

That inertial and gravitational mass are the same to this stunning accuracy 
must be more than a coincidence. Following this line of thought Einstein was 
led to a completely novel theory of the gravitational force. 
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Figure 3.3 Left: The vertical gravitational force Fe is slightly deviated towards 
the South by the centrifugal force Fo caused by the Earth rotation. Right: Cut 
through a torsion balance used by Eötvös in his experiment in 1898 (Credit: 
Springer Nature [228], reprinted by permission). The probe is located in the 
vertical tube. The torsion angle is read via a mirror on the axis. 


3.5 GRAVITY 


It is thus impossible to distinguish a system with no gravity from a system in 
free fall inside a homogeneous gravitational field. When you are in free fall, you 
do not feel your own weight. This is what we call “weightlessness” in space, like 
on board the International Space Station: it is in free fall around the Earth, its 
rotation is caused by a (nearly constant) initial horizontal velocity. Its inhab- 
itants thus float freely inside the ISS as a reference system’. Einstein [280, p. 
228] called such a system a “pocket edition” of an inertial system, since it is 
limited in space and time. The inability to tell the difference between the two 
cases, no gravity or free fall, is called the equivalence principle. 

With this in mind, Einstein developed a generalisation of relativity from 
the special case of inertial systems to any system of reference, i.e. general 
relativity as the theory of space-time and gravity. Three conditions had to be 
fulfilled by the new theory: 


1. The gravitational theory should be applicable to any reference system. 
In the absence of gravity, special relativity had to be included. 


2. The gravitational laws should be structural ones. Just as Maxwell’s elec- 
tromagnetic theory surpassed Coulomb’s and Ampére’s laws by describ- 
ing the spatial and temporal change of the electromagnetic field, so 


3Its inhabitants do feel a very small residual of the gravitational force, called micrograv- 
ity. It is caused by horizontal deceleration due to the very much diluted atmosphere at 400 
km altitude. 
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should the new theory surpass Newton’s law of gravity by describing 
the change of the gravitational field by a set of partial differential equa- 
tions. 


3. The geometry of space should not be Euclidean, its structure should be 
determined by masses and their velocities. Indeed [280, p. 251]: “The 
gravitational equations of the general relativity theory [should] try to 
disclose the geometrical properties of our world.” 


Following these lines, Einstein painstakingly arrived at his revolutionary the- 
ory of general relativity during the years 1907 to 1915. General relativity is a 
metric theory at heart, a theory that describes the generation and propagation 
of gravitational fields by a deformation of Euclidean space to a curved space. 
So after replacing absolute time by a prescription for the synchronisation of 
clocks, Einstein found that space was not what we naively think. Instead of 
moving along straight lines, bodies in free fall follow geodesics, the shortest 
paths between two points in a curved geometry. 

In Newtonian gravity, mass is the source of the gravitational field (see 
Section 2.1). In general relativity this role is taken by the energy-momentum 
tensor. A tensor is a general concept in vector algebra, originally invented 
to describe the deformation of solids. Here it includes the densities of three 
quantities -mass, energy and momentum- and expresses surprising features 
of theirs, like stress, pressure and shear. It does not need reference to a given 
coordinate system and is readily generalised to a curved space-time. An exam- 
ple is the Ricci tensor, which describes the change in volume of a cloud of mass 
points, which are initially at rest and then fall freely. Einstein’s theory relates 
these two tensors, the energy-momentum tensor and the Ricci tensor, to the 
metric tensor of space-time. The results are called the Einstein field equations. 
In a nutshell, space-time tells matter along which path to fall and matter tells 
space-time how to curve’. 

There are numerous experimental proofs for the validity of this concept. 
Most prominently, the frequency of photons is shifted towards the blue when 
they fall towards a massive body, towards the red when they are emitted 
away from it. This effect is known as gravitational time dilatation and means 
that processes in general run more slowly when close to a massive body than 
away from it. Precision measurements of this frequency shift have been made 
by Robert V. Pound and collaborators [345, 347, 381] using as frequency 
standard the nuclear magnetic resonance, which he had helped to discover. 
Gravitational time dilatation in the Earth field has been verified by atomic 
clocks. It must be corrected for to achieve the impressive accuracy of the 
Global Positioning System (GPS) in your car and your smartphone. 

Light follows a geodesic instead of a straight line. This fact was verified by 
observing that light from distant astronomical objects is deviated by the Sun, 
first by Arthur Eddington and collaborators [184] during the total eclipse of 


“This slogan and variants thereof are often attributed to John Archibald Wheeler [409]. 
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May 29, 1919. Through this bending of light rays, massive astronomical objects 
form gravitational lenses. Depending on their mass distribution, they can pro- 
duce multiple or characteristically deformed images of objects that lie behind 
them in the line of sight. Gravitational lensing is used in modern astronomy 
to look into the distribution of mass in galaxies and clusters, including mass 
that does not radiate, absorb or reflect light, which is called dark matter. We 
will come back to this mysterious form of matter in Chapter 10. 

At the end of a massive star’s life, when its fuel runs out and the ratio 
of its mass to its radius becomes sufficiently large, gravitational collapse cre- 
ates an environment where gravity rules. General relativity then predicts the 
formation of what is called a black hole, a region where gravitational forces 
are so large that neither matter nor radiation can escape. This appears to be 
a ubiquitous phenomenon, all usual galaxies including the Milky Way have a 
massive black hole in their centre, weighing a few million to a few billion solar 
masses. In current models of structure formation, black holes play a prominent 
role when galaxies and their clusters form. 

An early spectacular prediction of general relativity [169, 175] is the emis- 
sion of gravitational waves. These are ripples in the metric of space-time which 
are emitted by dynamic gravitationally bound systems. An early indication of 
their existence was the slowing of the emission frequency of pulsars, rotating 
neutron stars where the axis of rotation is misaligned with their magnetic 
axis. They emit electromagnetic radiation like the beacon of a light house, 
the period of light pulses arriving on Earth indicates the rotation frequency. 
Their spin down is due to a loss of rotational energy, in the form of both elec- 
tromagnetic and gravitational waves. Observations agree quantitatively with 
the predictions of general relativity [604]. In 2016, the giant interferometers 
Advanced LIGO and Virgo reported the first observation of a gravitational 
wave, from the merger of two black holes [630]. A triumph of general relativity 
100 years after it was first published as a theory. 

The fiftieth anniversary of the Apollo 11 moon landing in 2019 reminded 
me of other spectacular tests of general relativity by the Lunar Laser Ranging 
experiments. They use mirrors deposited on the lunar surface by the manned 
Apollo and robotic Lunokhod missions [513]. The timing of very strong laser 
light, about 2.5s delay between emission on Earth and arrival of the reflected 
signal, allows to determine the distance of the Moon to millimetre accu- 
racy [578], even though only one in 1017 photons arrives back. It is found 
that the Moon orbit agrees with general relativity within the accuracy of the 
measurement [549]. Earth and Moon fall towards the Sun with equal acceler- 
ation, confirming the equivalence principle [467]. And Newton’s gravitational 
constant is indeed constant within (2+ 7) x 10713 per year [571]. 

In practical terms, effects of gravity are negligible in the context of particle 
physics; this is why we do not include technical detail here. The gravitational 
force is many orders of magnitude weaker than concurrent forces at micro- 
scopic distances. Indeed, we still do not know how gravity works at short 
distances, since general relativity resists quantisation efforts for more than a 
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century. However, gravity is the dominating force at cosmological distances. 
We will come back to important consequences of general relativity for the 
evolution of our Universe in Chapter 10. 
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CHAPTER 4 


Atoms and nuclei 


Shortly after electrons were discovered it was thought that atoms 
were like little solar systems, made up of a ... nucleus and electrons, 
which went around in “orbits,” much like the planets ...around the 
sun. If you think that’s the way atoms are, then you’re back in 1910. 


Richard P. Feynman, The Strange Theory of Light and Matter, 
1985 [457] 


rather care about the continuity or discontinuity of ideas as we follow our 
eductionist narrative. In this chapter we look at how classical physics arrived 
at the notion of atoms in the 18th and 19th centuries. And how Rutherford, 
Geiger and Marsden discovered their substructure. 

The uncertainty about the ultimate nature of matter is of course much 
older than physics. Indian and Greek philosophers already disputed whether 
matter was granular or continuous, made of atoms or made of symmetries. But 
their theories were based on pure thinking, without much input from Nature 
itself. So we jump ahead to the beginning of the scientific study of matter, in 
physics and chemistry, in the 18th century. 


Y OU HAVE NOTED by now that I do not care too much about chronology. I 
r 


4.1 ATOMISM 


For many, scientific atomism starts with Daniel Bernoulli’s book “Hydrody- 
namica” of 1738 [3]. The bulk of the book deals with the dynamics of fluids 
as the title says, based on the conservation of energy. This conservation law 
was then still far from established. In Chapter 10 of his book, Bernoulli intro- 
duced the kinetic theory of gases, describing them as made of microscopic 
rigid bodies in motion, which scatter elastically from one another and from 
the boundaries of the gas volume. He demonstrated that pressure is caused 
by their impacts on the boundaries, as we show in Focus Box 4.1. Thus heat 
was identified as the kinetic energy of the gas atoms or molecules. 
Important steps along the way to scientific atomism are due to chemists. 
Prominent among them, Antoine Lavoisier introduced quantitative measures 
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to chemistry in the late 18th century [6] and came up with a novel nomencla- 
ture for chemical elements [4]. Carefully weighing reactants and products of 
chemical reactions in sealed glassware, he established the conservation of mass 
in chemical reactions. He coined the term “elements” for substances which 
cannot be broken down further by chemical reactions. Lavoisier was also a 
political reformer and philanthropist. On false accusations, he was beheaded 
during the French revolution. 

John Dalton, of modest origins and educated by Quakers, was appointed 
teacher of mathematics and natural philosophy in 1793 at the “New College” 
in Manchester, a dissenting academy not conforming to the Church of England. 
At the turn of the century he resigned and began a new career as a private 
tutor in Manchester, then a thriving city thanks to textile industry. Dalton was 
a versatile researcher interested in subjects from metrology to colour blindness. 
He was, however, often content with rough measurements, from which he drew 
bold conclusions. His important contribution to atomism is the law of multiple 
proportions in chemistry [8, 9]. Consider two substances which form more than 
one compound. If the same mass M of the first element combines with masses 
mı and ma of the second, then the law says that the ratio mı /ma is a small 
integer. A good example are the oxides of carbon, CO and CO2. If you know 
that atoms form molecules, it is clear that oxygen masses needed for the two 
molecules have a ratio of 1:2. If you do not know that, the observation will lead 
you, like Dalton, to become an atomist. He estimated relative atomic weights 
based on this law, but got almost all numbers wrong because of his lack 
of experimental rigour. Nevertheless his conclusions survived more rigorous 
measurements. 

In 1808, at the same time when the first volume of Dalton’s magnum opus 
“A New System of Chemical Philosophy” appeared, Joseph Louis Gay-Lussac 
found a novel feature of the chemistry of gases. Not only did definite pro- 
portions of weight combine to form compounds, but also definite proportions 
of volume did [10]. The explanation of this fact came with another revolu- 
tion, initiated in 1811 by Conte Amedeo Avogadro [11], professor of physics 
at the University of Torino. His hypothesis that equal volumes of any gas at a 
given temperature and pressure always contain the same number of gas par- 
ticles exactly fit both Dalton’s law and Gay-Lussac’s findings. To distinguish 
between atoms and molecules, Avogadro adopted terms including “molécule 
intégrante” (today molecule of a compound, like CO2), “molécule constitu- 
ante” (molecule of an element, like O2), and “molécule élémentaire” (atom, 
like O)!. Avogadro’s number, the number of molecules per mol of substance, is 
fixed by the international system of units? to exactly Na = 6.02214076 x 1023 
mol-!. The role of this constant in gas pressure is explained in Focus Box 4.1. 

The discovery of Brownian motion by the botanist Robert Brown in 
1827 could have lent strong support to the atomist view, but passed almost 


1See: https: //www.brittanica.com/biography/Amedeo-Avogadro 
https: //physics.nist.gov/cgi-bin/cuu/Value?na 
P pay 8 & 
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Gases where only elastic scattering among molecules and with the confining 
boundaries occurs are called ideal gases. Noble gases are a particularly good 
example. We calculate their pressure as a function of temperature according 
to the kinetic theory. Imagine an average molecule bouncing off the container 
wall with an orthogonal component Ux of its velocity. When the gas is in equi- 
librium, half of the molecules will move with an average velocity component 
+0,, the other half with —v,. Only the former will impact the right wall. 


The number of molecules which can impact on 

the boundary surface A within a total time t 

is the number moving right in a volume Aidzt. vt 

It amounts to p(Ad,t) /2, where the factor 1/2 S = 
comes from the fact that only half of the total Wx oe 
number density p moves right. Elastic scatter- | A \ 
ing will reverse the orthogonal velocity com- > | 
ponent, as sketched in the figure. The change í | 
in velocity by the scatter is thus 20s. Conse- Pa Wx) 


quently, the force that the average molecule È 


with mass m exerts is ÂF = m2%,/öt, where 
ôt is the average contact time during which 
the force acts. 


During that short time period, a fraction of öt/t of the molecules moving 
right hits the wall. The total force per unit surface, the pressure, is thus 
p = pmv?. The average velocity in each direction is given by the law of 
equipartition, which states that every degree of freedom of the gas molecule 
on average has the same energy, E, = kT/2, where T is the temperature 
and k is the Boltzmann constant, with its modern value of 1.380649 x 10723 
J K~!. The average orthogonal speed is thus 02 = kT/m, and the pressure 
is p = pkT. The number density p of the gas molecules is the number of 
moles, n, times Avogadro’s number of molecules per mole, N4, divided by the 
container volume V. With this we obtain the more familiar form of the ideal 
gas equation pV = nNagkT. This equation contains no particular property 
of the gas except that a mole consists of N4 molecules, it is valid for any 
ideal gas. The combination R = Nyk is thus rightly called the universal gas 
constant. 


Focus Box 4.1: Pressure in ideal gases 
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unnoticed by physicists. Brown had observed random motion of pollen sus- 
pended in water [13, 14]. Finally in 1863 Christian Wiener [29] provided a 
kinetic theory of the phenomenon’. The pollen on the liquid surface are pushed 
around by water molecules. For that to happen, they must be small enough so 
that impacts in different directions do not average out, and light enough for 
the small forces to cause visible displacements. Jean Perrin [135] quotes work 
by Father Carbonelle SJ from the late 1870s to summarise this approach: 


In the case of a surface having a certain area, the molecular 
collisions of the liquid which cause the pressure, would not pro- 
duce any perturbation of the suspended particles, because these, 
as a whole, urge the particles equally in all directions. But if the 
surface is of area less than is necessary to ensure the compensa- 
tion of irregularities, there is no longer any ground for considering 
the mean pressure; the inequal pressures, continually varying from 
place to place, must be recognised, as the law of large numbers no 
longer leads to uniformity; and the resultant will not now be zero 
but will change continually in intensity and direction. Further, the 
inequalities will become more and more apparent the smaller the 
body is supposed to be, and in consequence the oscillations will at 
the same time become more and more brisk ... 


For many major steps forward in science the parenthood is disputable. This 
may well be due to the fact that novel concepts do not just appear overnight, 
they are the product of circulating experimental results and theoretical ideas. 
This can create some kind of l’air du temps, an atmosphere in which progress 
is pushed forward and scientific consensus is formed [404]. In particular that 
seems to be the case for the discovery of periodicity in the sequence of chemical 
elements [403]. In the early 19th century, William Prout and Johann Wolfgang 
Döbereiner had noted that atomic weights appeared in rough multiples of 
the hydrogen mass. They formulated the “triad law”: elements with similar 
chemical characteristics formed groups of three, the triads, in which the atomic 
weight of the middle element is the arithmetic mean of the lighter and heavier 
one. Examples of triads are the halogens chlorine, bromine, and iodine or the 
alkali metals lithium, sodium and potassium. These sequences were found to 
extend beyond triads in the first half of the 19th century. 

In 1863, the English chemist John Newlands published a classification of 
the 56 then established elements in 11 different groups in a series of letters 
to the journal Chemical News [28, 30, 31, 33]. As the chief chemist of a sugar 
factory, he had little academic standing. When he finally read a paper to 
the Chemical Society in 1866, it was not taken seriously. One of the atten- 
dants even asked “whether he had ever examined the elements according to 
the order of their initial letters” [34]. Newlands certainly contributed to the 


3For the history and theory of Brownian motion see: Edward Nelson, Dynamical Theo- 
ries of Brownian Motion, https://web.math.princeton.edu/~nelson/books/bmotion. pdf, 
Princeton University Press (2001). 
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idea of the periodic system, if he counts among the fathers of the system 
itself is debatable. The break-through came in 1869 with the work of the Rus- 
sian chemist Dmitri Ivanovich Mendeleev [35], which was clearer and more 
consistent than what had been published through the 1860s. Almost simulta- 
neously, the German Lothar Meyer published similar findings [36]. Both had 
come across the periodicity in chemical properties as a function of atomic 
weight while preparing textbooks on chemistry. On the basis of empty places 
in their tables, Mendeleev and Meyer predicted further elements that had 
yet to be discovered. Mendeleev also discussed their properties. In 1882, both 
Meyer and Mendeleev received the Davy Medal from the Royal Society in 
recognition of their work on the periodic law. Figure 4.1 shows the oldest 
surviving example of a classroom periodic table from 1885, kept at the Uni- 
versity of St. Andrews in Scotland and discovered in 2014 in a storage room 
of its School of Chemistry*. Today’s table counts 118 elements, with all up to 
element 101, mendelevium, identified by chemical analysis. The discovery of 
plutonium Pu during research for nuclear weapons (see Chapter 6) started a 
race for the synthesis of man-made elements, which is still ongoing at major 
accelerator centres around the world, like LBNL in Berkeley, USA, JINR in 
Dubna, Russia, GSI in Darmstadt, Germany, and RIKEN in Wako, Japan. 

We do not want to jump ahead too far, but keep in mind that the chemical 
properties of elements are determined by the configuration of their electrons, 
thus by their nuclear charge, not their atomic mass. You will notice that an 
entire group is missing from the early table, the noble gasses. Helium was 
discovered by the astronomers Pierre Janssen and Joseph Norman Lockyer 
in 1868, identified by its characteristic spectral lines in the light from the 
Sun’s chromosphere, only visible during eclipses. Four others, argon, krypton, 
xenon and neon, were discovered by the Scottish physical chemist Sir William 
Ramsay in collaboration with John William Strutt, Lord Rayleigh, who also 
established that they formed an entirely new group of elements, because of 
their chemical inertness. Both received a Nobel prize in 1904 to honour their 
discovery, Ramsay for chemistry, Rayleigh for physics. 

So by the end of the 19th century, the atomic composition of matter was 
largely accepted. At that time, however, atoms were considered by many physi- 
cists to be purely hypothetical constructs, rather than real objects. An impor- 
tant turning point was Albert Einstein’s (1905) [119] and Marian von Smolu- 
chowski’s (1906) [124] papers on Brownian motion, which succeeded in making 
accurate quantitative predictions based on the kinetic theory. We sketch the 
argument and Jean Perrin’s verification of the result in Focus Box 4.2. 

Why was Brownian motion more effective in fostering atomism than other 
arguments? I think that the cause may well be the direct visual observation. 
When observing the jitter of small suspended particles through a microscope 
one has a direct impression of the random impacts of molecules. This is a valid 


4nttps: //www.thesaint-online.com/2019/01/worlds-oldest-surviving-periodic- 
table-found-in-st-andrews/ 
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Einstein treated the problem of Brownian motion as a diffusion process. He 
showed that the mean quadratic displacement x? (in one dimension) of a small 
suspended particle of radius r, observed over a time period T, is: 


Peas 
u 3anrNa 


R is the universal gas constant (see Focus Box 4.1), T the temperature, 7 the 
viscosity of the liquid and Na Avogadro’s number of molecules per mole. The 
relation can be used to determine Avogadro’s number in a way independent 
of the ideal gas equation. A simple derivation in the Newtonian spirit of Focus 
Box 2.7 was given by Paul Langevin [125]. A suspended particle with mass m 
has the following one-dimensional equation of motion: 
d’x dx 

ae a 
The thermal force Fy in the x direction works against the frictional force fvg. 
We multiply this equation by x and obtain: 


1 d (dz? de \* pl 

m m = gF; — =f — 

2 dt \ dt dt ” 2° dt 
When one takes the average of this equation over multiple observations of the 
same particle during time 7, the first term on the right averages to zero. The 


average of dx? /dt is the mean quadratic deviation x? per unit time, the second 
term on the left is twice the mean kinetic energy kT/2: 


1 d [x 1, [x2 
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This is a differential equation for x?/r, a quantity which grows over time to 
reach an asymptotic value of: 


Replacing f = 6rnr according to Stokes Law (see Focus Box 2.7) and k = 
R/Na, we obtain Einstein’s formula quoted above. The linear relation between 
mean displacement and observation time was verified experimentally by Jean 
Perrin in 1909 [135], using a camera lucida to project and measure the motion 
of suspended particles. His experiment has been repeated in 2006 by Ronald 
Newburgh and collaborators [564] in a modern version using a CCD camera. 
Some systematic dependence on the particle size is observed for the obtained 
value of N4, but the linear relationship is verified. 


Focus Box 4.2: Brownian motion 
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Figure 4.1 The oldest surviving classroom poster (according to the Guinness 
Book of Records) of Mendeleev’s Periodic Table of Elements dated 1885, 
found at St Andrews University in Scotland in 2014. (Credit: University of 
St. Andrews Library Ms39012) 


answer to Ernst Mach’s famous (but undocumented) dictum to question the 
reality of atoms: “Have you seen one?” Or at least as close as one could get 
until the scanning tunnelling microscope (STM) was invented by Gerd Binnig 
and Heinrich Rohrer in 1986 [459]. 


4.2 ATOMIC SPECTRA 


Another fact known about atoms in the second half of the 19th century 
was that they radiate light when heated. The emitted light can be observed 
through a prism as a series of coloured lines with dark spaces in between. 
Each element produces a unique set of spectral lines. Likewise, atoms absorb 
ultraviolet, visible or infrared light only for the wavelengths they emit. This 
fact was established by the Swedish physicist Anders Jonas Angstrém in the 
1850s [22]. In the 1860s, Ängström studied the emission spectrum of the Sun 
and identified more than 1000 spectral lines. Gustav Kirchhoff and Robert 
Bunsen [25], a physicist and a chemist, associated the wavelengths of lines to 
the elements that emitted them. It was thus found that four of the lines in the 
visible sunlight belong to the hydrogen spectrum [22], with wavelengths 410 
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nm, 434 nm, 486 nm, and 656 nm. These data were used by the Swiss mathe- 
matical physicist Johann Jakob Balmer to establish an empirical formula [46] 
for the hydrogen emission wavelengths: A = Bn?/(n? — 4) with B = 364.5 nm 
and n an integer number greater than 2. The formula was later generalised by 
the Swede Johannes Rydberg [53] to all hydrogen-like ions, i.e. those which 
have one electron like to Het, Li?*, Be?* and B*t: 


1 1 1 

x TEH e = =) 
with the Rydberg constant Ry = 4/B. The typical atomic size and typical 
length of crystal bonds, 1A=100pm=107!°m, was named to honour this pio- 
neer of atomic spectroscopy. It is not an SI unit, but still regularly used e.g. 
in crystallography and chemistry. 

In addition to the distinct lines, all elemental spectra also show bands 
at low wavelengths. And metals have especially rich spectra, with hundreds, 
sometimes thousands of narrowly spaced lines. The origin of lines and bands 
was not known, but atomists and especially J.J. Thomson believed that they 
had somethings to do with the electrons in matter. 

In 1897, the Dutch Pieter Zeeman observed a splitting of spectral lines 
when a static external magnetic field is applied to the emitting substance [78, 
79]. Sometimes lines split into two, sometimes into three separate weaker 
lines [77]. There were many attempts to explain this phenomenon, none really 
succeeded before the advent of quantum mechanics and the discovery of spin 
(see Chapter 5). The same is true for the analogous line splitting in elec- 
tric fields, established by Johannes Stark [159] after initial observations by 
Woldemar Voigt [105]. 


4.3 ELECTRONS IN MATTER 


We noted in Chapter 2 that soon after Thomson discovered the electron in 
1897, he suggested that it was an ingredient of all matter. He conjectured 
this, since cathode rays were emitted regardless of the cathode material or 
the nature of the rest gas in his tubes. The convincing proof of this fact, 
however, had to wait for the study of direct interactions between light and 
atomic electrons, the photoelectric effect”. During his experiments on elec- 
tromagnetic waves Heinrich Hertz had noticed that the intensity of sparks 
in his spark gap emitter (see Figure 2.4) was weaker when it was in the dark 
and stronger when illuminated [49]. His assistant Wilhelm Hallwachs observed 
that focussing ultraviolet light on zinc plate connected to a battery caused a 
current to flow [52]. Using differently pre-charged plates, he concluded that 
this was due to negative charges being emitted from the plate. J.J. Thomson 
showed in 1899 that these were the same as the “corpuscles” he had discov- 
ered [91]. Philipp Lenard followed up on these experiments using a powerful arc 


5The history of photoelectric emission is discussed in detail in [476]. 


Atoms and nuclei M 55 


lamp to generate light of variable intensity. Using a set-up sketched in Focus 
Box 4.3, he measured the energy and intensity of the emitted electrons [106], 
even though he believed that electrons were immaterial. He found the aston- 
ishing fact that the electron energy had a fixed minimum, independent of the 
light intensity. He wrongly concluded that the electron energy was already 
present when they were bound in matter and that light only triggered their 
emission [432]. This stayed the generally preferred assumption until Millikan, 
using an arc lamp strong enough to allow spectral separation, established in 
1914 that the electron kinetic energy was proportional to the wavelength of the 
light and used this to measure Planck’s constant [157, 171] (see Chapter 5). 

It was generally assumed at the time that if granular matter contained 
electrons, there would be a great many per atom. This guess was based on 
the small ratio of electron mass to atomic mass on one hand, and the rich 
atomic spectra on the other hand. The emission of heated substances displayed 
many spectral lines, which were assumed to come from the eigenfrequencies 
of electrons in atoms. For this to work, there would have to be hundreds per 
atom. A first atomic model inspired by astronomy was published in 1903 by 
Hantaro Nagaoka [115], a pioneer of modern Japanese physics in the Meiji 
era. His model was inspired by Maxwell’s essay on the stability of Saturnian 
Rings [23], which won the Adams prize of the University of Cambridge for the 
year 1856. Maxwell had examined the possibility that the rings were solid, 
liquid or fragmented in nature and had concluded that mechanical stability 
of their motion required them to consist of many small fragments circulating 
Saturn independently. There was thus a tempting analogy to atomic models 
with a large positive charge in the centre surrounded by a densely populated 
electron ring, generically called Saturnian models. Except that electrons of 
course repel each other, creating a mechanical instability in the system, given 
the 1/r? distance law for the Coulomb force. One can in principle balance the 
attractive central force by the repelling force among the ring particles. But 
the slightest deviation of one electron in the radial direction would make the 
ring collapse or explode. The latter event was conjectured to be the cause of 
radioactivity. 

Clearly, electrons in the ring would have to be constantly accelerated by 
the central force to stay on their orbit. They would thus also lose energy 
due to electromagnetic radiation like all accelerated charges. Eventually they 
would fall into the positive central charge. However, for a very large number 
of electrons, this radiative instability is not very important. While individ- 
ual circulating charges radiate, a continuous circular DC current does not. It 
was thus the mechanical instability, not the radiative one that was the main 
argument against Saturnian models. 

The same year 1904, J.J. Thomson proposed a different model, in which 
positive charge and electrons would not be separated, but inhabit the atomic 
volume together. That would reduce the distance law to 1/r, since only the 
positive charge within the electron ring radius would act according to Gauss’ 
law. The title of his paper [116] resembles a modern restaurant menu, where 
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The experimental set-up used by Hertz, Hallwachs, 
Thomson, and Lenard to look into the photoelec- 
tric effect is sketched here. It consists of a discharge 
tube with a quartz window, which lets UV light from 
an arc lamp pass and hit the metallic cathode. This 
causes a current to flow if the energy of electrons lib- 
erated by the light is sufficient to reach the anode. 
The voltage between cathode and anode can be reg- 
ulated to be positive as well as negative. 


If the anode charge is negative, it will repel the electrons. At the negative 
stopping potential Vo, the current A will stop. In that case, the potential 
difference between cathode and anode times the electron charge, eVo, is just 
equal to the kinetic energy Ekin of the electrons liberated from the cathode. 
For positive voltages, the photocurrent quickly saturates. 


To the surprise of many, when the light intensity 
I is increased, the saturation current increases, but 
—— Í; the stopping voltage does not change, as schemati- 
V cally shown by the VA-graph on the left. The electron 
[ - kinetic energy is thus independent of the light inten- 
sity. 
The stopping voltage only changes when varying the 
frequency v of the light. It is in fact proportional A 
to that frequency, or inversely proportional to the 
wavelength. The shorter the wavelength, the higher Z er 
the kinetic energy of the liberated electrons. Thereis / / / 
also a maximum wavelength above which no current ae v 
flows irrespective of the voltage. 


= 


This cannot be understood by classical electrodynamics. There the energy 
of an electromagnetic wave is proportional to its intensity, thus the opposite 
should happen: the kinetic energy of kicked-out electrons should be propor- 
tional to the light intensity, the wavelength should have no influence. To under- 
stand the phenomenon, one needs to treat the beam of light as a collection of 
particles, photons, each with its own fixed energy (see Section 5.2). 


Focus Box 4.3: Photoelectric emission 
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the name of the dish contains the whole recipe: “On the structure of the atom: 
an investigation of the stability and periods of oscillation of a number of cor- 
puscles arranged at equal intervals around the circumference of a circle; with 
applications of the result to the theory of atomic structure.” Thomson’s model 
does not at all deserve the nick name “plum pudding model”, which suggests 
a static scatter of electrons in a sticky, positively charged dough. Instead, his 
model consisted of an orderly distribution of corpuscles on rings, dynamically 
moving without friction inside a sphere of homogeneously distributed positive 
charge. The electron distribution was inspired by experiments of the Ameri- 
can physicists Alfred M. Mayer in the later 1870s [43]. Mayer had studied the 
distribution of magnetic needles, made to float upright in water, under the 
influence of a strong magnet held above. He found that the needles arranged 
themselves in rings keeping regular distances, and suggested that this finding 
could have a bearing on atomic models. Thomson systematically studied the 
stability of electrons arranged in a ring as a function of their number®. He 
found that large numbers of electrons can only be stably accommodated in 
several concentric rings [116]: 


We have thus in the first place a sphere of uniform positive elec- 
trification, and inside this sphere a number of corpuscles arranged 
in a series of parallel rings, the number of corpuscles in a ring vary- 
ing from ring to ring: each corpuscle is travelling at a high speed 
round the circumference of the ring in which it is situated and the 
rings are so arranged that those which contain a large number of 
corpuscles are near the surface of the sphere while those in which 
there are a smaller number of corpuscles are more in the inside. 


Thomson’s model, even though only calculable in a two-dimensional 
approximation, was the reference atomic model in the first decade of the 20th 
century. Max Born chose it as the subject of his 1909 habilitation lecture 
in Göttingen. The mechanical instability of Saturnian models was overcome, 
only the radiative one remained; it was taken as the cause of radioactivity. But 
the nature of the positively charged atomic substance remained mysterious. 
In any case, despite many efforts the Thomson model failed to explain atomic 
spectra. 


4.4 NUCLEI 


The nature of the positive charge in atoms was brought to light by Ernest 
Rutherford and his colleagues Hans Geiger and Ernest Marsden at the Physi- 
cal Laboratories of the University of Manchester. After undergraduate studies 
in his native New Zealand, Rutherford joined Thomson’s Cavendish labora- 
tory in Cambridge and soon became its rising star. In 1898, he was nominated 
professor at McGill University in Montreal, Canada, where he started research 


6A detailed translation of Thomson’s calculation into modern terms is found in [595]. 
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Rutherford and Soddy understood in 1903 that radioactive decay is a random 
process governed by the laws of probability [109]. The probability, that a given 
atom (or rather nucleus, as we know today) decays in the laps of time between 
t and t+dt, is a constant independent of t. That means that the nucleus has no 
memory, it does not “age”. This fact is also valid e.g. for lotteries: the fact that 
a certain number has not been drawn for a number of weeks does not mean that 
it will be more likely to be drawn next week. In a well-implemented lottery, 
the probability for all numbers to be drawn is constant, and (hopefully!) the 
same. 

All nuclei decay independently of each other. Thus out of a number N (t) that 
exists at time t, a constant fraction dN/N will decay in the laps of time dt: 


a =Adt ; N(t)=N(0)e 


The number of intact nuclei decreases exponentially with time, the sample 
ages, the atoms in it do not. The constant X is called the decay constant or 
decay rate. It is a characteristic of each radioactive nucleus, its inverse is called 
the lifetime T = 1/A of the nucleus. Another often used measure is the half-life 
tı/a =7 In 2, the time at which half the nuclei have decayed. 

If nuclei were not independent of each other, terms inversely proportional to 
N(N — 1), the number of pairs, or N(N — 1)(.N — 2), the number of triplets, 
etc. would have to be taken into account. If radioactivity were not a random 
process, the decaying fraction would not be constant. In a recent long term 
test, Dobrivoje Novković and collaborators have measured the decay rate of 
198 Au, a radioactive isotope of gold, over more than two months [560], close to 
twenty times its lifetime. They found no deviation from the exponential decay 
law and A to be constant within the statistical errors of their measurement. 


Focus Box 4.4: Radioactive decay law 
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on radioactivity together with the radiochemist Frederick Soddy. They dis- 
covered several radioactive elements, including radium, and established the 
law that describes the timing properties of radioactive decay [109] (see Focus 
Box 4.4). 

In 1907, Rutherford moved to the Victoria University of Manchester. He 
and Thomas Royds proved in 1909 that alpha particles are doubly ionised 
helium [129]. In parallel, Hans Geiger and Ernest Marsden started investi- 
gating the scattering of these particles by matter. They found that while the 
scattering angles were small in general, a tiny fraction of the alpha parti- 
cles, of the order of one in several thousand, was scattered by large angles, 
even reflected backwards [127]. This is inconceivable, if the positive charge in 
an atom were smeared out over the atomic volume. The qualitative observa- 
tion thus triggered Ernest Rutherford to propose a configuration of the atom 
where the positive charge is concentrated in a small nucleus at the centre of 
the atom [140], surrounded by only as many electrons as necessary to balance 
the central charge. The negative charge density of the electrons, small since 
they are few and take up the whole atomic volume, does not contribute to 
large angle scattering. He thus calculated the rate of scatters off the com- 
pact nucleus alone using classical electrodynamics. The results predicted the 
following features (see Focus Box 4.5): 


e The number of deflected a particles per unit time, the scattering rate, is 
proportional to 1/sin* (6/2), where 8 is the scattering angle. The rate of 
scatters thus decreases very rapidly with angle, but is still appreciable 
for angles larger than 90°. 


e The rate is proportional to the thickness of scattering material for small 
thicknesses. 


e It is proportional to the square of the central positive charge of the 
target atom. 


e It is proportional to the inverse fourth power of the velocity of the inci- 
dent a particles, i.e. inversely proportional to the square of their kinetic 
energy. 


Guided by Rutherford’s predictions, Geiger and Marsden perfected their 
experimental set-up (see Focus Box 4.6) and systematically tested these pre- 
dictions. They measured the scattering rate as a function of angle for various 
target materials and kinetic energies of the a projectiles. Their quantitative 
experimental results [150] confirmed all of the above predictions concerning 
relative scattering rates. They were even able to obtain a rough result on the 
absolute scattering rate and established that the charge of the nucleus is about 
half the atomic number of the target. Their results clearly demonstrated that 
the positive nucleus is much smaller than the atom. The 5 MeV a particles 
approached the atomic centre to about 45fm in a head-on collision, a gold 
atom is about 3000 times larger, 135pm. The size of the gold nucleus is only 
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We calculate the fraction dI/I of the initial intensity 
I of projectiles with charge ze, scattered off a target 
which consists of nuclei with charge Ze. Classically, 
all particles which pass by with an impact parameter 
between b and b+ db are scattered by the same angle. 
The scattering angle 0 depends only on this impact 
parameter, the transverse distance between projectile 
er and target. 

The ne material is characterised by 
its density of atoms, p, and the foil thick- 
ness, Ar, which has to be small enough so 
that multiple scatters can be neglected. The 
fraction of the beam intensity which comes 
by a target particle is then pAr. The scat- 
tered fraction is thus dI/I = pAa(2mb db). nenn 


How does the scattering angle depend on b? We consider elastic scattering 
off a static target, the kinetic energy of the projectile is thus conserved and 


+ze 


|u;| = |ör| = vo. Newton’s equation yields: 
= dv didy  (ze)(Ze) f dv zZe?r 
dt dy dt Arneo r? dy 4reomr?(dy/dt) 


The angular velocity dy/dt can be deduced from the equally conserved 
angular momentum L: 


dy dù zZef 
L u 2 om b A ee 
de nee : dy  4regL 


Integrating from initial to final state we have: 


Uf :Ze I: 
di = dy = ðf- Ù; 
A AneoL Jo í 


The length of the vector on the left is |ö,; — ö;| = 2vo sin (0/2). The integral 
over dy gives a vector of length 2cos (0/2), also in the direction (dr — Ö;). 
Substituting this result we obtain the relation between the impact parameter 
b and the scattering angle 8: 


zZe? 


—- cot = 
2 
ATEegmug 2, 


The scattered fraction of the beam intensity is thus: 


dI zZe? ae i 
rs IpA 
dQ e(r) sin? 2 


where dQ = sin 0 dé dọ is the solid angle subtended by the observer. 


Focus Box 4.5: Classical electromagnetic scattering off a heavy nucleus 
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about 7fm, so the projectiles did not touch it, and the Coulomb force is the 
only one acting. 

So far so good. Matter consists of atoms. Electrons somehow surround 
their compact nucleus without mechanical instability nor radiative collapse. 
But how do they manage to do that? And how do the positive charges in the 
nucleus clump together in a minute volume, vanquishing Coulomb repulsion? 
Classical physics has no answer to these questions. But quantum mechanics 
does. 
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Credit: Taylor & Francis [150] 


The Geiger and Marsden set-up shown above in a cut [150] and top view was 
enclosed in a cylindrical metal box B, sealed by the top cover P. It contained 
the source of a particles R, the scattering foil F, and a microscope M to which 
the zincsulphide screen S was rigidly attached. The box was fastened down 
to a graduated circular platform A, which could be rotated by means of the 
airtight joint C. By rotating the platform, the box and microscope moved with 
it, while the scattering foil and radiating source, attached to the tube T and 
fastened to the stand L, remained in position. The box could be evacuated 
through the tube. The radiation from source R could be attenuated by the 
diaphragm D, both in kinetic energy and in intensity. 

Geiger and Marsden used several radioactive substances made available by 
Rutherford to measure the scattering rate off materials ranging from carbon 
to gold. Most of their measurements were made with “radium emanation” as 
a source, i.e. radon, ???Rn, and its decay product polonium, ?!®Po. Both emit 
a particles with a kinetic energy of about 5 MeV. 

The number of impacts per unit time was counted on the phosphorescent 
screen which emitted a flash of visible light when hit by an alpha particle. This 
could be reliably done by eye for impact rates between 5 and 90 per minute. 
For small scattering angles, the microscope was thus pulled further out to 
decrease the solid angle dQ, for large angles it was pushed closer to the target 
to increase it. Also observations at small angles were done when the sources 
had weakened according to the decay law. The measurements obviously were 
corrected for these variations in solid angle and source strength. The rates 
were also corrected for stray 8 rays from the source, measured in absence of 
a foil. 

The scattering rates were measured for angles between 5° and 150°, and target 
materials varying in thickness by a factor of 9 and in material from carbon 
(Z = 6) to gold (Z = 79). The kinetic energy of the a projectiles was degraded 
by inserting mica foils between source and target. All of Rutherford’s predic- 
tions for the relative variation of the scattering rate as a function of these 
parameters were verified. 


Focus Box 4.6: The experiment of Geiger and Marsden 
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The beauty and clearness of the dynamical theory, which asserts 
heat and light to be modes of motion, is at present obscured by two 
clouds. I. The first came into existence with the undulatory theory of 
light ...; it involved the question, how could the earth move through an 
elastic solid, such as essentially is the luminiferous ether? II. The second 
is the Maxwell-Boltzmann doctrine regarding the partition of energy. 


Lord Kelvin, Nineteenth Century Clouds over the Dynamical Theory of 
Heat and Light, 1901 [103] 


and forces are described at small distances. How perspective is changed 

introducing quantisation of energy. How quanta move through space-time 
in the form of granular waves. What the difficulties were and still are in 
interpreting quantum mechanics. 

There is no other way to introduce this subject than by considerations 
about heat and radiation. Thermodynamics is not exactly my favorite subject. 
The reason may be the proliferation of macroscopic notions, like entropy, 
enthalpy and other ‘natural’ state variables, which seem to defy reductionism. 
Where others see the emergence of novel features, when a system goes from 
few to many ingredients (like e.g. Ilya Prigogine [498] and his followers), I 
only see statistics at work. But for the same reason, thermodynamics had a 
crucial role in the development of quantum mechanics. In the beginning of 
the 20th century, the Right. Hon. Lord Kelvin (William Thomson, not to be 
confused with J.J.) identified two “clouds” hanging over the kinetic theory of 
heat, named in the initial quote of this chapter. The first “cloud” has been 
dealt with by Einstein’s relativity theory in 1905 (see Chapter 3). His second 
“cloud” has been dealt with by Max Planck and Einstein in the same period. 
It refers to the Maxwell-Boltzmann distribution of the energy of molecules in 
an ideal gas in thermal equilibrium (see Focus Box 5.1). It turns out that their 
distribution law is one of a class of laws describing the energy distribution in 
thermal equilibrium. And what Lord Kelvin really meant is the challenge to 
understand why and how that equilibrium is reached. 


I N THIS CHAPTER we see how quantum mechanics kicks in when matter 
y 
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5.1 HEAT 


The main findings of 19th century thermodynamics can be summarised in 
three fundamental laws, which describe thermal systems in equilibrium: 

The first law is an application of the conservation of energy, well known 
from the laws of motion (see Focus Box 2.2). It is based on the idea that as 
heat produces work (like in a steam engine), so should work be able to produce 
heat. The idea that heat and work are both forms of energy was proposed by 
Julius Robert von Mayer [15] in 1842. Independently James Prescott Joule [16] 
published his findings about heat produced by mechanical and electromagnetic 
work in 1843. In 1850 both published a numerical value for the mechanical 
heat equivalent, with the more reliable number coming from Joule [18]. There 
was a lengthy fight for precedence of these findings, but that does not need 
to concern us here. Joule’s name was chosen for the energy unit of the SI, 
J = Nm = kgm?/s?. 

What does concern us is the fact that conservation of energy requires that 
one can increase the internal energy U of a thermal system in two ways: by 
putting an amount Q of heat into it, or by applying an amount W of work. 
Conversely, the internal energy decreases, when heat or work is extracted from 
the system. In summary: U = Q— W, where the sign of W indicates that work 
is done by the system, which is normally what one wants. 

The second law is concerned with the way that thermal equilibrium is 
reached. Imagine two thermal systems both in equilibrium, but not at the 
same temperature. When they are brought into contact, heat will naturally 
be transferred from the hot one to the colder one, not vice versa. Heat transfer 
will continue until both reach the same temperature. The temperatures of the 
two system do not drift apart, they drift towards a common one. The contact 
of the two systems may involve mixing, i.e. exchange of material, or just 
thermal contact. The law thus formalises the fact that thermal systems tend 
towards spatial homogeneity of matter, energy and temperature. The pilot 
of this irreversible action was called entropy by the German physicist Rudolf 
Clausius. 

In the 1850s and 1860s, he argued that a change occurs not only in the 
system that is worked on, but also in the working engine. There is an inherent 
loss of “usable” heat when work is done [17]. Clausius described entropy as 
the dissipative energy use of a thermodynamic system or working body during 
a change of state. This is summarised in the fundamental thermodynamic 
relation: 


dU =T dS — pdv 


It relates the change of internal energy U to temperature T and change of 
entropy S, in addition to the mechanical work, —pdV, done by expanding 
the volume V at constant pressure p, e.g. by moving a piston. The minus 
sign indicates that the system is providing work, not receiving it. What 
entropy really is was finally found by Ludwig Boltzmann in the 1870s [63]. He 


Quanta M 65 


interpreted entropy in the framework of statistical thermodynamics. If there 
are N different ways to form a system’s state from microscopic ingredients, 
then S = klog N is the entropy of the state, where k is the Boltzmann con- 
stant we met earlier (see Focus Box 4.1). The number of realisations N can 
be seen as a probability of the macroscopic state: the more ways there are to 
realise it, the more likely it becomes. That entropy rises in systems which are 
left to evolve freely is thus simply due to the fact that they evolve from less 
probable to more probable states. In isolated systems the change of entropy 
between initial and final state is thus always positive and the evolution is irre- 
versible. If the system is not isolated, then the sum of the entropies, system 
plus surroundings, increases. 

The third law of thermodynamics finally takes care of the temperature 
scale. If the temperature of the macroscopic system is proportional to the 
average kinetic energy of its microscopic components, then there must be a 
minimum to temperature, T = 0 in temperature units of Kelvin. However, this 
also means limr_,g S = 0, such that this temperature can be approached, but 
not actually reached. The third law was formulated by the physical chemist 
Walter Nernst in 1905. 

In a classical ideal gas with massive particles, the energy of the particles 
is distributed according to a Maxwell-Boltzmann distribution. Equilibrium is 
established by the particles themselves: they collide with each other, exchang- 
ing energy and momentum in the process. Maxwell [26, 27] had based his 
derivation of the energy distribution on a Newtonian approach to the kinetic 
theory, as we did in Focus Box 4.1. Ludwig Boltzmann [37, 42] gave it firm 
grounds in statistical thermodynamics. The Maxwell-Boltzmann distribution 
is an approximation for more general ones in the case of high temperatures 
and low densities (see Focus Box 5.1). 

Another member of the family of equilibrium distributions is Planck’s law 
of black body radiation, a special case for massless bosons which do not inter- 
act with each other at all. Its derivation by Planck, and its connection by 
Einstein to the properties of atoms, triggered the quantum revolution of the 
first decade of the 20th century. 

A black body is an ideal substance which emits and absorbs all light, 
i.e. electromagnetic radiation in all frequencies. The corresponding radiation 
is indeed a perfect example of a thermodynamic system. First experimen- 
tal approaches to measuring the energy distribution of this radiation, in the 
infrared for realistic temperatures, used roughened or blackened metal surfaces 
to realise a black body at least approximately. It was then realised that any 
enclosure of fixed temperature, which was impenetrable to electromagnetic 
radiation, contains a black body spectrum of light. Gustav Kirchhoff wrote 
in 1860 [24]: “When a volume is enclosed in bodies of equal temperature and 
when radiation cannot penetrate these bodies, then every bundle of rays in 
this volume has the same quality and intensity, as if it were emitted by a black 
body of the same temperature, it is thus independent of the nature and shape 
of the bodies and only depends on the temperature.” In modern language, 
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this statement means that black body radiation is a gas of photons. Since 
photons do not interact with each other for lack of electric charge, equilib- 
rium is established by interaction with the walls of the cavity, the interaction 
between photons and matter. The temperature of the walls thus determines 
the temperature of the photon gas. 

Measurements of the spectrum of black body radiation [520] require three 
ingredients: a black body constructed according to this recipe; a detector able 
to measure the intensity of radiation in a wide range of frequencies; and an 
analysing device capable of separating frequencies of light in the infrared. All 
three were developed in the 1890s at the Physikalisch-Technische Reichsanstalt 
(PTR) in Berlin‘, a research institute without teaching mission founded on 
the initiative of Werner von Siemens and Hermann von Helmholtz, which 
combined free fundamental research with services for industry. The industry 
tycoon von Siemens ceded private land in Berlin-Charlottenburg to the Reich- 
sanstalt, Helmholtz became its first president. The motivation for research on 
black body radiation was the search for a reliable device to gauge light sources, 
a standard candle, driven by the increasing competition between gas light and 
electric light. 

A black body cavity was successfully constructed by Otto Lummer, Ferdi- 
nand Kurlbaum and Ernst Pringsheim at PTR in the late 1890s. It consisted 
of an electrically heated platinum cylinder, blackened on the inside”, enclosed 
in a second insulating cylinder. Radiation could leave the black body through 
diaphragms at the end of the cylinder. 

A detector for a wide spectrum of light, the bolometer, was invented in 
1878 by the American astronomer Samuel P. Langley. It is essentially a very 
sensitive calorimeter (see Section 8.3). The sensitive part consists of a thin 
strip of material with low heat capacity like platinum, such that small radia- 
tion intensities warm it up as much as possible. Its temperature is measured 
through its thermal resistance, compensated with a Wheatstone bridge. At 
PTR, Lummer and Kurlbaum [55] refined the bolometer such that it had an 
impressive resolution and stability, capable of measuring temperature differ- 
ences of 1077 degrees in a short exposure. 

Spectroscopic separation of wavelengths was achieved with specialised 
prisms and lenses made of halite, fluorite and sylvine, which transmit light 
efficiently up to far infrared wavelengths. Combined with a bolometer with a 
narrow sensitive strip, one thus obtains a so-called spectrobolometer, apt to 
make precision measurements of the black body spectrum up to high temper- 
atures. 

In 1893, Wilhelm Wien derived a first law for the spectral radiance of a 
black body [57], i.e. the power per emitting area, solid angle and frequency 


lSee https://www.ptb.de/cms/en/about-us-careers/about-us/history-of-ptr-and- 
ptb. html 

?Black surfaces were at the time obtained by using carbon pigments, typically lamp black 
produced by collecting soot from oil lamps. Modern technology based on carbon nanotubes 
obtains a record absorptivity of 99.995% from ultraviolet to terahertz frequencies [665]. 
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range, it is quoted in Focus Box 5.1. From thermodynamical considerations 
he concluded that the wavelength at the maximum of the spectrum depended 
linearly on temperature. This relation is called Wien’s displacement law. 


5.2 QUANTUM LIGHT 


Lummer and Pringsheim [87] at PTR found that Wien’s law fitted their 
observed spectrum rather well between T ~ 800K and 1400K and wavelengths 
between lum and 6um, but also observed systematic deviations. Enlarging 
the temperature and wavelength range to 1650K and 8.3um, respectively, the 
deviations did not go away [88]. Figure 5.1 shows these results. The observed 
spectrum lies systematically above Wien’s law for long wavelengths and high 
temperatures. However, data obtained by Friedrich Paschen and Hans Wanner 
supported Wien’s law [90, 89, 100] in the visible and near infrared range, for 
low temperatures. 


Schwarzer Körper 


Figure 5.1 The spectral radiance of a black 
body as a function of wavelength for differ- 
ent temperatures [88]. The values measured by 
Lummer and Pringsheim (solid line) are com- 
pared to Wien’s law (dashed line). Systematic 
deviations are observed for long wavelength 
and high temperatures. (Credit: Verhandlun- 
”ı gen der DPG, 1899) 
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A breakthrough came in 1900 [188], when Lummer and Pringsheim 
obtained a sylvine prism (potassium chloride KCl) and extended the wave- 
length range of their experiments to long wavelengths, 12 to 18m, and even 
higher temperatures, exceeding 1770K [94]. The wavelength range was further 
extended by Heinrich Rubens and Kurlbaum [101] using the so-called rest- 
strahlen (residual ray) effect. Their method profited from the fact that cer- 
tain materials totally reflect a certain narrow band of wavelengths. It allowed 
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A distribution law describes the way that a measurable quantity, velocity, 
energy, wavelength or frequency, is distributed among the microscopic ingre- 
dients of a thermodynamic system. The archetype is the Maxwell-Boltzmann 
distribution of the energy of rigid particles, scattering elastically off each other. 
The probability f of finding a particle with energy between € and e + de ina 
given degree of freedom of the particles is: 


1 
nekT 


fle, T)de = eT de 

Here T is the temperature and k denotes Boltzmann’s constant. Note that 
this is a marginal probability, applicable to this particular degree of freedom 
when any energy is admitted for the others. All other thermal energy distri- 
butions have the Maxwell-Boltzmann distribution as their asymptote if the 
temperature is sufficiently high and the density sufficiently low. 

If these conditions are not met, one finds two general cases: the Fermi-Dirac 
distribution for fermions, i.e. particles with half-integer spin like the electron; 
the Bose-Einstein distribution for bosons, i.e. particles with integer spin like 
the photon. The latter distribution has a special case relevant for black body 
radiation. Since that consists of a photon gas, the ingredients cannot scatter 
off each other. If there is interaction only with the walls of the container, one 
obtains Planck’s radiation law: 


It describes the spectral radiance B of the gas, i.e. the radiated power per 
unit area, solid angle and wavelength. The law has two limiting cases. For the 
limit of low frequencies, Planck’s law tends towards the Rayleigh-Jeans law: 


2ckT 


Brs(A,T) = y 


In the limit of high frequencies Planck’s law contains the Wien approximation: 


2h Z he 
Bw(A,T) = Era 
The maximum of this spectrum is inversely proportional to temperature, 
Nias = 2-288 cm, for T in Kelvin, a fact which is also called Wien’s displace- 
ment law. 


Focus Box 5.1: Dictionary of radiation laws 
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a further extension of the measurements up to more than 50um wavelengths. 
In the meantime Lord Rayleigh had “ventured to suggest a modification” [98] 
to Wien’s law, suggesting that the spectral radiance be proportional to tem- 
perature at long wavelengths. This was indeed in agreement with Rubens’ and 
Kurlbaum’s results. 

Meanwhile, Max Planck followed a completely different scientific pro- 
gram. In an article describing the winding road to his radiation law [298], 
he states:.“What has always interested me in physics were the great universal 
laws, which are important for all natural processes, regardless of the proper- 
ties of intervening bodies, and of the ideas one forms about their structure.” 
He therefore took a special interest in the laws of thermodynamics. Again 
according to his own judgement, he was particularly inspired by the writings 
of Clausius, “due to their excellent clarity and persuasiveness of language.” 
Entropy became the subject of his doctoral thesis [44] in Munich in 1879. 
After a first appointment in Kiel, in 1889 he was named the successor to 
Kirchhoff at the Friedrich-Wilhelms-Universitat in Berlin. Starting in 1895, 
he published a series of papers in the reports of the Berlin Academy, docu- 
menting the progress of his work. The aim was to prove the second law of 
thermodynamics, based on the kinetic theory of heat. His working model was 
a black body cavity, with harmonic oscillators inside. He managed to prove 
Wien’s approximation this way, but further progress was limited. When he 
learned about Lummer’s and Pringsheim’s findings of deviations from Wien’s 
law of radiation and Rubens told him about the long wavelength behaviour of 
black body radiation, he approached the problem again starting with entropy 
(see Focus Box 5.2). That led him to an empirical radiation law interpolating 
between Wien’s approximation and Rayleigh’s conjecture. But how could the 
constants appearing in the law be calculated based on first principles? 

This problem made him surmount his aversion to statistical thermodynam- 
ics and use Boltzmann’s approach, which formulates entropy as a measure of 
the probability of a macroscopic state in terms of microscopic ingredients. 
Following this line, as sketched in Focus Box 5.2 he found that there was 
indeed a finite number of small portions of energy, quanta, which make up 
the internal energy of the system. The energy of a light beam is thus not 
continuous, but comes in quanta of hv, proportional to the frequency v of the 
light with Planck’s constant h intervening. He presented his findings to the 
German physical society in December of 1900 [97], as shown in the protocol 
of their session, Figure 5.2. It was published in 1901 [104]. 

This marks the birthdate of quantum physics. Arnold Sommerfeld [194, 
p. 36] wrote in hindsight: “The quantum theory is a product of the twentieth 
century. It came to life on 14th December, 1900.” But according to Helge 
Kragh [521], “if a revolution occurred in physics in December 1900, nobody 
seemed to notice.” The real importance of energy quantisation only came 
to light with Einstein’s interpretation of the photoelectric effect [120], which 
linked black body radiation to matter properties. 
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The second law of thermodynamics, in the absence of external work, states 
how entropy varies with internal energy, dS/dU = 1/T. Proportionality of the 
energy with temperature, as suggested by Rayleigh, thus meant d?S/dU? = 
const/U?. Wien’s approximation for short wavelengths, on the other hand, 
means that d?S/dU? = const/U. A radiation law interpolating the two would 
have: 


as const O G 1 


= B(A, T) = E 
dU? U(U + const) á ety Ad est — 1 


Planck communicated this result to Rubens the same day on a post card [188]. 
It described the experimental data very well, especially improved later mea- 
surements [189], but it was not based on first principles, as Planck intended. 
He had so far ignored Boltzmann’s statistical interpretation of entropy, but 
was now forced to use his approach, “since no other way out opened” [298]. 
This required counting the microstates that made the distribution. Instead 
of only one oscillator inside the cavity, he now based his calculation on a 
large number n of oscillators. The total internal energy U is also decomposed 
into p equal portions e. Boltzmann’s number N of microstates forming the 
macroscopic state can then be counted by combinatorics: 


The interpolating radiation law comes out for e = hv, such that cı = 2hc? and 
c2 = hc/2rk with Boltzmann’s constant k and the speed of light c. Planck’s 
constant h has the dimension of a product of energy and time, he thus named 
it a quantum of action. The energy of the radiation in a black body is thus 
not continuous, it is quantised in tiny portions proportional to the frequency 
of the radiation. 


Focus Box 5.2: Planck’s radiation law and entropy 
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Figure 5.2 A copy of the protocol for the session of the German Physical 
Society DPG on December 14, 1900 [517]. It records two contributions by 
Max Planck, “About the so-called Wien paradox”, and “On the theory of 
the energy distribution law in the normal spectrum.” (Credit: Archiv der 
Deutschen Physikalischen Gesellschaft, Signatur 10008) 


Einstein’s heuristic argument goes as follows. If the light inside the black 
body cavity cannot interact with itself, it must be the reflection at the walls 
which establishes the thermal equilibrium. According to Planck’s law, the 
energy stored in the light comes in quanta, tiny portions which can be counted. 
In Einstein’s words [120]: “the energy of a light beam sent out from a point is 
not distributed to larger and larger volumes, but it consists of a finite number 
of energy quanta, localised in spatial points, which move without splitting and 
can only be absorbed or created as a whole.” This means that the atoms in 
the walls can emit or absorb quanta of a definite energy. In his seminal paper 
Einstein first demonstrates Planck’s radiation law based on this evidence. He 
then makes the connection to the photoelectric effect (see Focus Box 4.3). 
There, the same principle applies: “Quanta penetrate into the surface layer of 
the [cathode] body, and their energy is transmitted at least in part into kinetic 
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energy of electrons.” This obviously explains, why the energy of cathode rays 
emitted by the photoelectric effect is proportional to the frequency of the 
light shining on the cathode. Their kinetic energy must be equal to (hy — E»), 
where E, is the binding energy of electrons in the wall. The quantisation 
hypothesis also explains why the photoelectric current increases with light 
intensity: more quanta eject more electrons. Einstein elaborates on this in a 
paper from 1906 [122]. He states”: “The energy of an elementary resonator can 
only take values which are integer multiples of [hv]; the energy of a resonator 
changes in a leap by absorption or emission, equally by an integer multiple of 
[hv].” This statement formulates the link between the quantum behaviour of 
light and the quantum behaviour of matter. 

That light consists of particles with definite energy and momentum became 
clear by the work of the American physicist Arthur H. Compton in 1923 [192]. 
He scattered monochromatic X-rays off a graphite target and found that the 
wavelength of the outgoing rays was increased with respect to the incoming 
one. He explained this as an inelastic scattering of light particles, photons, off 
electrons, as explained in Focus Box 5.3. In this process, photons behave as 
single particles with energy hv and momentum hv /c. 

So light has two faces. On the one hand, it is a Maxwellian electromag- 
netic wave, propagates with a constant speed in all reference systems and can 
interfere with other waves (see Focus Box 5.10). On the other hand, it is a 
particle interacting elastically with atoms and its energy is quantised. And so 
are the energy levels of atoms. 


5.3 QUANTUM ATOMS 


Rutherford had refrained from formulating a theory of how electrons fill the 
atomic volume around the compact nucleus, only making vague reference to 
saturnian models a la Nagaoka. Early endeavours to add small numbers of 
electrons to a central nucleus using Planck’s quanta, e.g. by Arthur Erich 
Haas [133] and John William Nicholson [143], failed since they were stuck with 
the idea that frequencies of revolution or axial vibration had to be proportional 
to the frequencies of the emitted light. 

In came Niels Bohr. He received his doctorate at the University of Copen- 
hagen in 1911 with a thesis on the electron theory of metals, which followed 
and improved on ideas of J.J. Thomson, treating metals essentially as a gas of 
quasi-free electrons. Endowed with a fellowship from the Carlsberg foundation, 
he went to Cambridge in September of 1911 and joined Cavendish laboratory, 
with the idea of translating his thesis into English and profit from advice by 
Thomson. His state of mind is rather well documented in numerous letters, 
mostly to his fiancée, future wife and lifetime love Margarethe Nørlund [594]. 


3In the spirit of this book I have converted Einstein’s notation into the modern one for 
Planck’s quantum of action h. 
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In the early 1920s, Arthur Compton scattered 
monochromatic X-rays off electrons. The kinematics 
of the process is shown on the left. The incoming 
photon has an energy of E, = hy and a momentum 


8) py = hv/c. The outgoing one leaves under an angle 0 

S with Æ} = hv’ and p}, = hv'/c. The outgoing energy is 

(E> is, 5 \ not the same as the incoming because of the electron 
recoil. 


The electron (mass m) is initially at rest, its momentum is negligible compared 
to that of the photon. It leaves with energy E and momentum p. Energy 
conservation in the process means: 


E, +m? =E + E., ; hv + me = E + hv’ 
E? = mh +e? Pd = (hy — hy’ +m? -mt 
Momentum conservation means: 
WWN\* (wN? R 
P=py-Py, ; aa ) H ( ) 2— vv’ cos 6 
c c c 


Using vA = v’X = c, we find for the loss in frequency and the increase in 
wavelength: 


h h 
= (1-cos0) ; A — A= —(1-— cos0) 


me? Me 


Focus Box 5.3: Compton scattering 
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Thomson listened to him politely but was uninterested to revisit his for- 
mer subject of study. John Heilbron [430] identifies their difference in work- 
ing style as one of the reasons. Bohr needed lengthy discussions to elaborate 
his ideas, often ending in monologue, while Thomson hardly involved even 
his close collaborators in the formation of his thinking. Thus Bohr accepted 
an invitation to Manchester in March of 1912 to join Rutherford’s labora- 
tory. There, radioactivity was the subject of the period. Bohr took courses 
and was assigned laboratory work, which he diligently executed. According to 
Michael Eckert [598], his attention was drawn to atomic theory by a study of 
multiple Coulomb scattering and energy loss of œ particles when traversing 
matter [149]. 

In the summer of that year, Bohr drew up what is known as the “Ruther- 
ford Memorandum”. It mainly addressed the question of mechanical stability 
a la Thomson, ignoring radiative instability. But it also contained first hints 
at his atomic model, introducing the “special hypothesis” that the kinetic 
energy of an electron be proportional to its revolution frequency. Bohr also 
thought he had a solution for the problem of periodicity in atoms, as exhib- 
ited by Mendeleev’s table [430]. The rest of the memorandum addressed simple 
molecules and chemical binding. He did not argue in terms of quanta, but the 
work of Nicholson put him on the track. In March of 1913 he sent the first part 
of his famous 1913 trilogy of papers [146, 147, 148] to Rutherford, in which 
he set out what we call today Bohr’s atomic model. It boldly left behind the 
classical treatment of the problem by what one can call heuristic axioms: 


e Not all classical orbits are available to the electron, but only those called 
stationary, where it does not radiate electromagnetic energy. The sta- 
tionary states can be numbered by a principle quantum number n, n = 1 
denotes the ground state of the atom. A specific energy En belongs to 
each such state. 


e The electron can jump from one stationary state 7 to another j. This 
“quantum leap” is outside the realm of classical physics. When the elec- 
tron undergoes this process, energy is conserved by emitting or absorb- 
ing an electromagnetic wave, with a frequency v such that its energy, 
hv = E; — Ej, corresponds to the energy difference between the two 
stationary states. 


It is thus not the frequency or energy of the orbiting electron, but the energy 
difference between the initial and the final stationary states which determines 
the frequency of the light. These hypotheses lead to the Rydberg formula for 
the Balmer series when one makes one of two additional assumptions. One 
hypothesis, sometimes called the third Bohr postulate, requires that in addi- 
tion to the energy, also the angular momentum of the electron’s orbit comes 
in quanta (see Focus Box 5.4). The other and more far reaching hypothesis is 
Bohr’s correspondence principle [605]. Roughly speaking it requires that for 
large principle quantum numbers, the orbit tends towards a classical one [183]. 
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What Bohr exactly meant by this is debatable, but the general requirement 
that a new theory must have its successful predecessors as a limiting case is 
not. 


Bohr’s model of the hydrogen atom put a heavy point-like nucleus with charge 
+e in the centre, surrounded by orbiting electrons of charge —e and mass m. 
Classically, the Coulomb force keeps them on an orbit of radius r: 


e 1 mv 


Aneo r? r 


Multiplying this equation with mr”, the right side of the equation becomes 
the modulus of the electron’s angular momentum L = F x p: 


e? 


T mr = (mur)? = L? 


Applying the quantisation requirement that L can only take values which are 
a multiple of A = h/(2r), one obtains a value for the radii of stationary orbits: 


„Aneoh? 
n=n 2 
me 
The total energy of the stationary electron can be calculated using the virial 
theorem: 


1 1 e 1 e? ? mi 1 
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The energy thus also comes out quantised. The coefficient of the n? term is 
called the Rydberg energy, it is proportional to the Rydberg constant, Er = 
hcRy. The energy difference between two energy levels n and m is thus: 


he 1 1 
AE = — = Em — En = Er | = - — 
A ii ( n? m ) 
with n = 1,2,3,... and m = n+1,n +2,n +3,... For n = 2 we thus recover 
the Balmer series of hydrogen spectral lines (see Section 4.2), for n = 1 and 
n > 2 the other spectral series (named after Lyman, Paschen, Brackett, Pfund 
and Humphreys). 


Focus Box 5.4: Bohr atom and spectral lines 


Bohr’s approach to atomic theory obviously breaks with classical physics 
in a bold way. None of his axiomatic hypotheses is compatible with classical 
electrodynamics and mechanics, nor rooted in an underlying new theory. Heil- 
bron [163] traces back Bohr’s boldness to the philosophy of Harald Høffding 
and William James. Høffding breaks down truth to an analogy between actual 
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reality and its image in the human mind. There are multiple partial truths, 
which justify themselves by the fact that they work. In James’ words quoted by 
Heilbron: “The true is the name of whatever proves itself to be good [that is, 
useful and thus valuable]* in the way of belief.” Thus two practical arguments 
justify abandoning the classical approach: the results explain the observed 
line spectra and the Rydberg constant comes out correctly in terms of known 
natural constants. 


Figure 5.3 Physicists and chemists at a gathering in Berlin-Dahlem in 1920 in 
honour of James Franck’s appointment to a professorship at the University of 
Gottingen [672]. Back row, left to right: Walter Grotrian, Wilhelm Westphal, 
Otto von Baeyer, Peter Pringsheim, Gustav Hertz. Front row, left to right: 
Hertha Sponer, Albert Einstein, Ingrid Franck, James Franck, Lise Meitner, 
Fritz Haber, Otto Hahn. (Credit: Archiv der Max-Planck-Gesellschaft, Berlin- 
Dahlem) 


Strong support to Bohr’s theory of distinct energy levels for electrons was 
given by the 1914 experiments of James Franck and Gustav Hertz, 
the nephew of Heinrich Hertz (see F 3). They demonstrated energy 
levels in mercury, using inelastic electron scattering, as explained in 

. The experiment demonstrates the existence of discrete energy levels 


Explanatory comment by Heilbron. 
The experiment and its results are thoroughly analysed in modern language by Robert 
E. Robson, Malte Hildebrandt and Ronald D. White 
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in mercury atoms. Inelastic collisions of electrons transfer just the energy to 
atomic electrons which is required to pass from one energy level to another. 
When they fall back to the ground state, light is emitted with the same energy 
that it took to excite them. 

The period of World War I, from 1914 to 1918, the ensuing isolation of 
Germany and economic and political problems of the Weimar Republic clearly 
reduced scientific output. Nevertheless, throughout the first two decades of 
the 20th century, many problems in atomic spectroscopy were attacked using 
quantum theory à la Bohr. The theory was given more solid mathematical 
footing by the work of Arnold Sommerfeld [173]. He generalised Bohr’s circu- 
lar stationary orbits, numbered by the main quantum number n, to elliptical 
ones, with additional quantum numbers / and m, which he called azimuthal 
and magnetic quantum numbers. They number the quantum states of orbital 
angular momentum and its orientation along an arbitrary axis. Spectroscopy 
thus defined a shell structure of atomic energy levels [178], with closely spaced 
levels sharing the same n. This structure was confirmed by the X-ray scat- 
tering experiments of Charles Barkla [138] and Henry Moseley [158]. Angular 
momentum states, which can be identified by the modulus of an orbit’s angu- 
lar momentum, can vary from l = 0 to 1 = n — 1. Their orientation, which 
can be thought of as the projection of the angular momentum vector onto an 
arbitrary axis, varies from m = —l to m = +l. Sommerfeld’s book “Atomic 
Structure and Spectral Lines” [181] concluded what is now called the “old” 
quantum theory. The angular momentum quantum numbers explain the line 
splitting in a magnetic field, the (normal) Zeeman effect. Paul Epstein [170] 
and Karl Schwarzschild [172] independently derived equations for the Stark 
effect in hydrogen. 

However there remained substantial flaws in the old quantum theory. Per- 
haps the most important one was that it provided no means to calculate the 
intensities of the spectral lines, i.e. the probability that excitation and de- 
excitation happens. It also failed to explain the anomalous Zeeman effect, 
where spin comes in. 

Spatial quantisation was clearly another quantum effect at odds with clas- 
sical electrodynamics, in which an angular momentum vector can make any 
angle with an externally axis, e.g. defined by the direction of a magnetic field. 
While searching to confirm that it indeed existed, electron spin was acciden- 
tally discovered in the famous Stern-Gerlach experiment (see Focus Box 5.6). 
Otto Stern had worked with Einstein in Prague and Zürich and became assis- 
tant to Max Born at the Frankfurt Institute for Theoretical Physics after the 
first world war ended. Before the war, he had learned about atomic beams 
formed by effusion of heated substances under vacuum®. Walter Gerlach had 
worked with Wilhelm Wien during the war to develop wireless telegraphy. He 
joined the Frankfurt Institute for Experimental Physics, adjacent to Born’s 
institute, in 1920. The two got together to develop a method to prove or 


6For a review of the history of molecular beams see [423]. 
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The Franck-Hertz experiment used an evacuated glass tube filled with rarefied 
mercury vapour, sketched above on the left. Electrons are emitted by the 
heated cathode on the left, and accelerated by a variable voltage V towards 
the middle grid. A small counter-voltage keeps low energy electrons passing 
the grid from reaching the anode on the right. The current A between grid and 
anode is measured by an ampere meter. Its dependence on the grid voltage is 
shown schematically in the right figure. 

Electrons on average have several hundred elastic interactions with mercury 
atoms on their way to the grid, the electron beam is thus diffused in direc- 
tion, but the electrons keep their energy. The current first increases with grid 
voltage. When the grid voltage approaches 4.9V, the current drops sharply. 
This indicates that electrons of kinetic energy 4.9eV interact with mercury 
atoms in an inelastic collision and lose most of their energy. At first Franck 
and Hertz thought that is energy was the ionisation level of mercury. It later 
became clear that this energy corresponds to the preferred transition of outer 
electrons (6'Sp — 6°P1)[613]. When one increases the grid tension further, the 
anode current increases again up to a maximum at 9.8V. This means that now 
the electrons have enough energy for inelastic collisions half way to the grid, 
their kinetic energy is lost there instead of close to the grid. The remaining 
potential difference accelerates them again, up to the necessary kinetic energy 
for inelastic collisions close to the grid. 

The zones of inelastic scattering can be made visible when using neon gas 
instead of mercury vapour. The neon excitation energy is approximately 
18.5eV, a weighted average of several transitions [613]. When the neon atom 
falls back to its ground state, it emits visible light. The zone of inelastic scat- 
tering becomes visible as a faintly glowing slice close to the grid. When the 
accelerating voltage is increased, the zone wanders towards the cathode. At 
twice the grid voltage, when the zone has reached half the cathode-grid dis- 
tance, a second glowing zone appears close to the grid and the anode current 
again reduces abruptly. 


Focus Box 5.5: The Franck-Hertz experiment 
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disprove space quantisation, despite everyone’s and especially Born’s scepti- 
cism. Atomic beams are key to the test, because they have no net charge and 
are not subject to Lorentz forces, but interact with the magnetic field via their 
magnetic moment only. The critical experimental obstacle was to produce a 
sufficiently strong gradient in a magnetic field to make the small magnetic 
moment visible. In a feasibility study [191], Stern had concluded that even a 
gradient at the limit of feasibility would only result in a minute deflection of 
the beam. 
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Figure 5.4 Gerlach’s postcard to Niels Bohr dated February 8, 1922. It shows 
a photo of the split silver beam deposit, with a short statement congratulating 
Bohr on the confirmation of his theory of spatial quantisation. (Credit: Niels 
Bohr Archive P006) 


After many tries and improvements to their apparatus and methods, in 
1922 they succeeded to observe the magnetic deviation of silver atoms. A 
triumphing postcard, which is reproduced in Figure 5.4, was sent to Niels 
Bohr [542]. It was first believed that they had demonstrated the quantisation 
of orbital angular momentum direction, which is not possible since the ground 
state of silver atoms has none. It took new ideas by George Uhlenbeck and 
Samuel Goudsmit to find out that they really discovered the electron spin. 

George Uhlenbeck was a man with wide interests in languages, history, 
mathematics and physics. Not being admitted to University because of a lack 
of Latin and Greek, he started his scientific education at the Delft Institute 
of Technology. When the law changed, he joined the prestigious University of 
Leiden in 1919, where he received a scholarship. After graduating in 1920, he 
attended courses with the charismatic Paul Ehrenfest and Hendrik Lorentz. 
After an episode as a private instructor in Rome, where he made friends with 
Enrico Fermi, he seriously hesitated between a career in physics or in his- 
tory. He was advised to first finish his advanced physics studies and became 
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An atomic electron which has total angular momentum J carries a magnetic 
moment ji, proportional to the Bohr magneton up: 


e 


= 2Me ee a h 
The second equation assumes that the angular momentum is quantised in 
units of fi as predicted by the Bohr-Sommerfeld model. The atom passes hori- 
zontally through an inhomogeneous magnetic field, which has a vertical inho- 
mogeneity 0B/Oz as schematically shown below. The atom will experience a 
force u(OB/öz) cos 9, where 0 designs the angle between ji and 0B/0z. It will 
thus suffer a deflection Az in the vertical direction: 


2 
Az= ad (2) (âa) cos 0 


2m \ dz v2 


where m is the atomic mass, Az is the length of the deflecting field region 
and v is the velocity of the atom. Classically, the beam would just be spread 
continuously from —Az to +Az, since the value of cos @ is arbitrary. According 
to Sommerfeld’s space quantisation, the possible angular orientations have 
cos 0 = mı/l, where l is the orbital angular momentum and m; is the magnetic 
quantum number. For l = 1 and m; = —1,0,+1, the beam should thus split 
in two with deviations —Az and +Az only. 


Starting in 1921, Stern and Gerlach set up a series of experiments schemat- 
ically shown above. An oven (1) emits a beam of silver atoms at 1000°. It 
passes through a pair of collimators (2) in pencil shape and goes through a 
strongly inhomogeneous magnetic field (3). It is deposited on a plate, either 
as a straight line (4, classical case), or as two symmetric dots (5, quantised 
case). The latter was observed by Stern and Gerlach when photographically 
developing the plate after many unsuccessful tries in early 1922. They and 
their contemporaries thought the deflection was due to the spatial quantisa- 
tion of orbital angular momentum. In reality, the total angular momentum 
in Ag atoms solely comes from the spin Š of its outermost 5S electron. It 
has l = 0, thus m; = 0 in Sommerfeld’s notation. However, the total angular 
momentum is not zero, but J = L +S = S, with a component along the 
magnetic field of m, = +1/2. The spin causes a magnetic moment ji, = guð 
with the Landé factor g ~ 2. 


Focus Box 5.6: Stern-Gerlach experiment 
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Ehrenfest’s doctorandus. His first assignment was to learn about atomic spec- 
troscopy from Samuel Goudsmit, a master student in Leiden. In summer of 
1925 they started to work together. According to Abraham Pais [466], “George 
was the more analytic one, better versed in theoretical physics, a greenhorn in 
physics research and an aspiring historian...Sem [Goudsmit] was the detec- 
tive, thoroughly at home with spectra. .. known in the physics community and 
a part-time assistant to Pieter Zeeman in Amsterdam.” Together they discov- 
ered that while the normal Zeeman splitting came from the orbital angular 
momentum of electrons, the abnormal Zeeman splitting must be due to an 
additional property of the electron, the spin. Spin is an angular momentum, 
originally thought to be due to a rotational degree of freedom of particles. 
However, since the electron has no size (as far as we know, see Chapter 9), it 
cannot be mechanically realised. Spin is an intrinsic property of the particle, 
like its charge and mass. Goudsmit and Uhlenbeck published their findings 
in a short note in 1925 [204], followed by a more substantial Nature paper in 
1926 [205]. The latter was followed by a very interesting postscript by Niels 
Bohr, where he pointed out that their conclusion, although still incomplete, 
“.,. must be more welcomed at the present time, when the prospect is held out 
of a quantitative treatment of atomic problems by the new quantum mechan- 
ics initiated by the work of Heisenberg, which aims at the precise formulation 
of the correspondence between classical mechanics and quantum theory.” 

At first the interpretation as an effect of spin met difficulties because of 
the Landé factor [186] of 2 which relates the spin angular momentum to its 
magnetic moment. The factor was soon identified as a relativistic effect [215]. 
Wolfgang Pauli worked out the non-relativistic theory of spin, using Heisen- 
berg’s matrix quantum mechanics. He realised that the complicated numbers 
of electrons in closed shells can be reduced to the simple rule of one elec- 
tron per state, if the electron states are defined using four quantum numbers 
instead of the three (n, l, mı) in the Bohr-Sommerfeld atom. For this purpose 
he introduced a new two-valued quantum number, Mms, corresponding to spin 
orientation [203]. His conclusion required the Pauli exclusion principle, stating 
that no two electrons -more generally no two fermions- can be in the same 
quantum state. For example, if two electrons reside in the same orbital, then 
their n, l, and m; values are the same, thus their spin projections must be 
different, with m; = +1/2 and —1/2. When Paul Dirac derived his relativistic 
quantum mechanics in 1928, electron spin was an essential part of it. 


5.4 CANONICAL QUANTA 


The complex path from the quantum atom to a complete quantum mechanics 
involved many contributors. It has been detailed by a contemporary contribu- 
tor, Bartel L. van der Waarden, reproducing and commenting original articles, 
in reference [399]. There were several “schools” involved in the process, which 
Victor F. Weisskopf described in the diagram reproduced in Figure 5.5. The 
arrows indicate the intensity of scientific exchange between the centres, the 
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many multiple entries witness the mobility of scientists in the aftermath of 
the first world war. We do not need to go into the detail of this development, 
but we will jump to the canonical formulations of quantum mechanics right 
away. 
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Figure 5.5 Schools of theoretical physics in the years of the foundation of 
quantum mechanics. Adapted from [441]. 


We have used plural here on purpose. There are indeed two such formula- 
tions, matrix mechanics by Werner Heisenberg, Max Born and Pascual Jordan; 
and wave mechanics by Louis de Broglie, Erwin Schrödinger and Peter Debye. 
Let us start with the matrix approach. 

One of the principle problems with quantum mechanics à la Bohr is that it 
can describe stationary states but has trouble with time dependent processes. 
Thus the energy levels are predicted roughly right, absorbed and emitted radi- 
ation energies come out correctly, when using the Bohr prescription decoupling 
these from the energy of a single orbit. But how long does it take the elec- 
tron to fall back to a lower energy state and how does it decide which one to 
choose? In other words, what are the intensities of emission lines? The problem 
with time dependence can be traced back to assuming classical motion of the 
electrons. Hans Kramers made a first attempt to calculate transition proba- 
bilities between quantum states in terms of a discrete-time Fourrier transform 
of the orbital motion [195, 196], ideas which were extended in collaboration 
with Werner Heisenberg to a semiclassical matrix-like description of atomic 
transition probabilities [201]. The standard narrative of quantum mechanics 
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Classically, the emission and absorption frequencies of an oscillating system 
are determined by their eigenfrequencies. This property is preserved in the 
old quantum theory of the atom a la Bohr. However, the orbital angular 
momentum is quantised, such that mur = nh/(27) (see Focus Box 5.4). This 
can be generalised to an orbital condition for momentum p and position q: 


T (T) 
dq a 
p— dt = pdq=nh 
| dt q(0) 


In the Kramer approach, emission and absorption is described via the time- 
discrete Fourrier transform of the orbital coordinate: 


oo 
q(t) = 5 TI) 
n=—co 
The Qn are complex coefficients, with Qn = Q*,,. According to the corre- 


spondence principle, the classical and quantum frequency should be the same 
for large n. The energy difference of close orbits n and m, (n — m) < (n,m), 
should then be (En — Em) ~ h(n—m)/T, where T is the classical period of the 
orbit. Heisenberg replaced this explicit description of the orbital motion (which 
is unobservable) by an equivalent description of the transition from n to m, 
which is observable via the radiated photon with frequency (Ey — Em)/h. For 
small n and m, these are no longer multiples of any orbital frequency. Heisen- 
berg noticed that the Fourrier transform Qnm of the transition amplitude has 
only one frequency: 


Qnm = e?Ti(En—Em)t/h Qnm (0) 


This is an expression for the evolution in time of Qnm, it can be read as an 
equation of motion for the emission or absorption. The equation shows again, 
that energy is the vis viva of evolution. The quantities described follow matrix 
algebra, such that a multiplication between e.g. position P and momentum Q 
matrix elements is defined as 


k=0 


The multiplication is not commutative: 


5 (OnkPkm u nk Qk) -= ihönm 


k 


This relation gives rise to Heisenberg’s uncertainty principle (see Focus 
Box 5.8). 


Focus Box 5.7: Matrix mechanics 
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has it that Heisenberg had the basic idea of a solution during a forced stay on 
the isolated island of Helgoland, where Max Born sent him in spring of 1925 
because of a bad hay fever. Heisenberg reformulated all of quantum theory in 
terms of a version of the transition matrices [200]. The principle ideas behind 
Heisenberg’s radical approach are sketched in Focus Box 5.7. Their foundation 
is Heisenberg’s deeply rooted conviction that physics should only deal with 
observable quantities. By this we obviously denote quantities that are measur- 
able in principle, not necessarily in practice. The orbital motions of electrons 
in an atom are not in this category, thus Heisenberg disposed of the concept. 
Instead, he based matrix mechanics solely on transitions between quantum 
states. Since quantum states can be numbered using quantum numbers, the 
transition from state į to state f thus involves a matrix element Miş, the 
square of which is the probability for the transition to happen. All possible 
matrix elements form a matrix, with index 7 running from 1 to the number 
of initial states, f from 1 to the number of final states. Consecutive transi- 
tions are described by multiplying matrix elements following matrix algebra, 
as Max Born and Pascual Jordan noted [198, 202]. Our modern view of the 
matrices is that they are operators acting on a state vector. They describe 
observables. If the state is an eigenstate of the operator (corresponding to an 
eigenvector of the matrix), it will project out an eigenvalue which corresponds 
to the result of a measurement. Wolfgang Pauli applied the new method to 
the hydrogen atom in 1926 [211] and showed that it reproduced the correct 
spectrum. 

An immensely important result of matrix mechanics is Heisenberg’s uncer- 
tainty principle. It states that a quantum system can only be the simultaneous 
eigenstate of two operators if these commute. In other words, among a pair of 
observables described by non-commuting operators, only one can have a fixed 
value. Examples of such pairs of mutually exclusive observables are momen- 
tum p and position r, where the uncertainty principle states that ArAp > h/2. 
For energy E and time t it reads AE At > h/2. We go into details in Focus 
Box 5.8. The principle has been introduced by Heisenberg in 1927 [221], and 
formally derived by Earl H. Kennard [222] and Hermann Weyl [229]. 

The uncertainty principle describes a fundamental property of quantum 
systems, not a limitation of measurement. It has often been confused with 
an interaction between a macroscopic measurement device and the quantum 
system itself. That would blame its existence on the interference of measure- 
ment and quantum object called the “observer effect”. This is partially due to 
Heisenberg himself, who used the latter as an example of a physical cause of 
the principle [237]. However, it is much more important than this. A particle 
at a defined space point does not have a defined momentum. A particle of 
fixed momentum is everywhere. 

In parallel to the impressively fast progress with matrix mechanics, an 
alternative was developed, which did not give up on the concept of describing 
the electron’s motion. In 1924, Louis de Broglie [199] introduced the idea that 
the relation between energy-momentum on one side and frequency-wavevector 
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We look into Heisenberg’s uncertainty principle using position and momentum 
as examples. Their respective operators Q and P do not commute, i.e. QP — 
PQ = ih. Let us assume that the particle is in an eigenstate w of the position 
operator Q for position q: 


QY = qy 


Applying the commutator above we find: 
(QP — PQW = (Q- q1)PY = thy 


where I is the identity operator, IV = w. Let us assume that the particle 
could simultaneously be in an eigenstate to P with eigenvalue p. We would 
then have: 


(Q —q1)P4 = (Q — q1)py = (q1 — q1)py = 0 


in contradiction to the commutation relation above. This means that the 
particle cannot have a fixed position and a fixed momentum at the same 
time. Denoting by Ap and Aq the momentum and position range the particle 
has, one finds Heisenberg’s uncertainty relation for position and momentum, 
AqAp > h/2, or energy and time, AEAt > h/2. Since h = 6.62607004 x 
10784Js is a very tiny action indeed, the uncertainty only manifests itself in 
quantum systems. You cannot use it as an excuse when caught speeding in 
your car. But is does manifest itself everywhere at the quantum scale. 


Focus Box 5.8: Heisenberg’s uncertainty principle 
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on the other side is valid for all particles, not only photons. He assigned a 
wavelength of A = h/p to the electron, or in fact any matter particle. The de 
Broglie wavelength is thus the ratio of Planck’s constant to the momentum of 
a particle. By describing the electron motion as a standing matter wave, he 
reproduced Bohr’s quantisation condition. The idea was strongly supported 
by Einstein, who thought that Heisenberg’s matrix mechanics was too formal. 
In 1927, Clinton Davisson and Lester Germer demonstrated the wave nature 
of electrons by diffraction [219, 224]. Peter Debye commented that if electrons 
behave as waves, their movement should follow a wave equation. 

Soon after, Erwin Schrödinger found that equation [213, 214], a differential 
equation for the evolution of a wave function «b(r,t) in three dimensions. 
He found it by the analogy with wave and geometrical optics’. We sketch 
the properties of the wave function and the Schrödinger equation in Focus 
Box 5.9. In 1926, Schrödinger solved the problem of an electron’s motion in the 
potential well created by the nucleus [212] and found the correct energy levels. 
Wave mechanics works, at least when particles are moving non-relativistically. 

Both approaches, matrix mechanics and wave mechanics, have been shown 
to be completely equivalent by Paul Dirac in 1926 [208]. The reason why I 
prefer the wave formulation is that it can be applied to interactions between 
particles, which has always been the subject of my own research. This widen- 
ing of the scope of quantum mechanics was initiated by Max Born, who had 
become a professor at University of Gottingen in 1921. He attracted many bril- 
liant people to this small provincial town in northern Germany. The first was 
his friend and colleague, the experimentalist James Franck. His PhD students 
included Siegfried Flügge, Friedrich Hund, Pascual Jordan, Maria Goeppert- 
Mayer, Robert Oppenheimer and Victor Weisskopf. Among his many notable 
assistants, Enrico Fermi, Werner Heisenberg, Wolfgang Pauli, Léon Rosenfeld, 
Edward Teller and Eugene Wigner stick out. In 1926, Born wrote two articles 
about the quantum mechanics of scattering processes in Zeitschrift fiir Physik, 
an introductory one [207] with little mathematics by his standards, followed 
by a more elaborate one [206]. In these papers he developed the statistical 
interpretation of the wave function which we summarise in Focus Box 5.9. In 
short, the square of the wave function w (r,t) is the probability per unit vol- 
ume to find the particle at position 7 at time t. In analogy to electromagnetic 
waves, where the intensity is the square of the amplitude, the wave function 
itself can thus be called a probability amplitude. Scattering modifies the wave 
function according to Huygens’ principle (see Focus Box 7.5). In a particularly 
enlightening presentation of his ideas in Nature [217], carefully translated into 
English by Oppenheimer, Born states: “We free forces of their classical duty of 
determining directly the motion of particles and allow them instead to deter- 
mine the probability of states.” We will come back to all of this in more detail 


T An enlightening derivation of Schrédinger’s equation by Richard Feynman is explained 
in [483]. 
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e The abstract state function Y contains all information about a physical state 
which can be obtained in principle. It is the equivalent of the state vector of 
matrix mechanics. In coordinates, it is represented by a complex wave function 
w(qi, Si, t), depending on the classical degrees of freedom q; (like position and 
momentum) and the quantum degrees of freedom s; (like spin), as well as 
time t. The wave function is not observable for a single particle (therefore 
Heisenberg did not like it). 

e A physical observable of the system, i.e. a property which can in principle 
be measured, is described by a linear hermitian operator Q. Pam are the 
momentum operator p; — -ih and the energy operator E > ing 

e A system is in an jentie UA of the operator Q, if OW, = = ws, Le. 
if the operator projects out an eigenvalue wn of the observable. These are 
quantised, so they can be numbered. For hermitian Q, the eigenvalues wn are 
real numbers. In coordinates one has Q(q, s, t)Wn(q, §,t) = WnWn(q, s,t). The 
eigenstates form an orthonormal basis, thus every possible state can be writ- 
ten as a weighted sum of eigenstates, Y =), @nYn with complex coefficients 
an- The squares of these coefficients are the probability densities to find the 
system in eigenstate n at time t, their space integrals are the corresponding 
probabilities. This statement is called the Born rule, the “fundamental inter- 
pretive postulate of quantum mechanics” according to Steven Weinberg [628, 
p. 29]. In particular, for a pure eigenstate n, p = Yž Wn = |Wn|? > 0 is the 
probability density to find it at t with properties q and s. 

e Because of the orthonormality, one has X, [ (dqji...)U7 (qi, $,t)Um(a, 8,t) = 
Önm: A measurement of an observable delivers the nth eigenvalue of Q with 
en la; 1°. Its a value, i.e. the average weighted by probability, 
is thus (0), = >>, [(dai-..)o* (ai, 8, HQW(Gi, st) = 0, Xn lan wn. 

e The we evolution of a jas is described by the Schrödinger equation, 
i0w /Ot = Hw with the linear hermitian Hamilton operator H, the correspond- 
ing observable is energy. In a closed system, H does not explicitly depend on 
time, ÖH/öt = 0. The eigenstates of H are then stationary, the eigenvalues 
are conserved. Because of hermiticity, one has 


u faw = I fw.) Erb) — v*(Hy)] = 


Probability is thus a conserved quantity, 4 f pdx = 4 [(dqj...)y*v = 0. 

e The simplest solution to Schrödinger’s equation is a plane wave with ampli- 
tude Wo, propagating in empty space in the direction k with frequency w = nv 
and wave number k = 2r /à: Y(F, t) = poet K’), 


Focus Box 5.9: A summary of non-relativistic wave quantum mechanics 
following Bjorken and Drell [379] 
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in Chapter 7. Born received the 1954 Nobel prize in physics for the statistical 
interpretation of quantum mechanics. 

I personally agree with Einstein and Born that wave mechanics is prefer- 
able to matrix mechanics when thinking about quanta. I like the idea that 
particles are point-like objects which move like waves. The wave aspect just 
means to me that one must abandon the idea of a trajectory, a smooth curve 
in space-time which locates the particle. Instead the probability amplitude 
defines a distribution of positions in space and time, a field, where the particle 
can manifest itself when measured. A decisive conceptual difference between 
trajectories and waves is that the latter can interfere with each other, the 
former cannot. We discuss interference phenomena in Focus Box 5.10 using 
light as a familiar example. 


5.5 QUANTUM NUCLEI 


Some twenty years after the discovery of the atomic nucleus, most physicists 
believed that it was a bound state of protons and electrons. The number of pro- 
tons would account for nuclear weight, the difference between the number of 
protons and electrons for the nuclear charge, which gives the atomic numbering 
scheme of the periodic table and determines their chemical properties. There 
are a couple of obvious problems with this idea. First, it is entirely unclear why 
a certain group of electrons would choose to be bound in the compact nucleus, 
while an equally large second group would fill the much larger atomic volume. 
No way an electromagnetic binding force between electrons and protons could 
provide such a feature. But there also was corroborating evidence. Nuclear 
beta decay emitted electrons, so shouldn’t they be present in the nucleus? In 
1919, Rutherford reported experiments which showed that bombarding nitro- 
gen nuclei with energetic alpha particles kicked out protons [179]. We can take 
the grandmaster himself as a witness of public opinion. In 1920 he wrote in 
his Bakerian lectures [185]: “We also have strong reason for believing that 
the nuclei of atoms contain electrons as well as positively charged bodies, and 
that the positive charge on the nucleus represents the excess positive charge. 
It is of interest to note the very different role played by the electrons in the 
outer and inner atom. In the former case, the electrons arrange themselves 
at a distance from the nucleus, controlled no doubt mainly by the charge on 
the nucleus and the interaction of their own fields. In the case of the nucleus, 
the electron forms a very close and powerful combination with the positively 
charged units and, as far as we know, there is a region just outside the nucleus 
where no electron is in stable equilibrium. While no doubt each of the exter- 
nal electrons acts as a point charge in considering the forces between it and 
the nucleus, this cannot be the case for the electron in the nucleus itself.” He 
speculated about the existence of a neutral nuclear bound state of a proton 
with an electron, which would be able to penetrate matter and even atoms. 
But in the same period of time, more doubts appeared. Franco Rasetti had 
measured the spectra of di-atomic molecules like O2 and Na, and found that 
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Diffraction of a plane wave of light occurs when 
it meets an obstacle. On the left, we consider a 
single slit of width w at a distance D from a 
screen. When the light with wavelength A meets 
the slit, according to Huygens’ principle, every 
\\ point in the aperture emits a cylindrical wavelet. 
w a BES The amplitudes of all these, originally in phase, 
A add up to form the outgoing wave. We discuss 

| E diffraction of the wave in the Fraunhofer limit, 
i i.e. for w?/(DA) < 1. What arrives at the screen 

AA A at normal incidence, 0 = 0, is a bright strip with 

single slit screen 5 fuzzy edge. 

At any other angle, when we compare the phase of the wavelet emit- 
ted from the middle of the slit to that emitted at the lower edge, we 
find a phase shift of 6 ~ wsin@. All of the wavelets coming from points 
with distance w/2 have that phase shift, so they sum up constructively. 
After a minimum at wsin@ = A, there is thus a secondary maximum at 
wsin@/2 = X. More maxima follow at regular intervals. Their displace- 
ment on the screen is ym = Dtand ~ Dsin = m2DX/w with an integer 
m. The intensity distribution is I(y) = Ipsin? 6/6? with 6 ~ mwy/(DA). 
The diffraction pattern is shown in the upper figure on the bottom left. 


incoming wave 


When there is a double slit, with negligible width, incoming wave 
we can treat both a la Huygens as point sources 
of in-phase monochromatic light. The geometry is 
shown on the right. The path difference between 
two rays is dsin@, such that maxima occur at 
Ym = mDA/d, all with equal intensity, as shown 
in the middle figure below. 


double slit screen 


LE When the width of the slits is not 
0.75 negligible, we find an overall inten- 
05 sity distribution characterised by the 
slit width, but with regular strips, 

0 25 5 75 10 125 15 175 20 225 25 4. shown in the lower figure. In 

1 ] the sketches above we have shown 
0.75 the wave character of the light as 
= wavefronts, the particle character as 

00 25 5 75 10 125 18 175 20 225 25 Straight rays. Indeed both aspects 

are necessary to understand the phe- 
Me: nomena of diffraction and interfer- 
05 ence. Classical interference experi- 
ments with visible light were per- 


0 25 5 75 10 125 15 175 20 225 25 formed by Thomas Young in 1801 [7]. 


Focus Box 5.10: Diffraction and interference 
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both nuclei have integer spins. For oxygen, this fits with a nucleus made of 16 
protons and 8 electrons. But for nitrogen, there ought to be 14 protons and 
7 electrons, thus half-integer spin. The plot thickened when Walther Bothe 
and Herbert Becker observed a penetrating neutral radiation when beryllium 
nuclei were bombarded with high energy a particles [235]. They conjectured 
that the radiation was of electromagnetic origin. That became unlikely when 
Irene and Frédéric Joliot-Curie measured very high energy protons exposing 
hydrogen-rich substances to Bothe’s radiation [247], which would be in con- 
tradiction to their own absorption measurements if the radiation were indeed 
made of photons. 

The particle behind the new radiation, the neutron, was found by James 
Chadwick in 1932 [244]. He was a student and close collaborator of Rutherford 
and his deputy in the direction of the Cavendish Laboratory. To determine 
the mass of the new neutral particle, he measured the recoil energy when 
they bounced off light nuclei like hydrogen and nitrogen. The result was, that 
their mass was not at all zero but roughly equal to the proton mass. The 
numbering scheme of Mendeleev, with atomic number equal to the number 
of protons in a nucleus, was completed by the fact that the nuclear mass was 
roughly proportional to the number of protons plus neutrons®. Nuclear masses 
were found to be roughly integer numbers, when expressed in the conventional 
mass units of one sixteenth of the oxygen mass. A notable exception was 
the hydrogen nucleus, with 1.008 mass units. The masses of many elements 
and their isotopes —nuclei with the same number of protons and a different 
number of neutrons— were measured with better that permille precision by 
the British physicist and chemist Francis William Aston, with his novel mass 
spectrograph [177, 182]. The mass unit, today called atomic weight and defined 
as one twelfth of the carbon isotope mass 1?C, is roughly the average of the 
proton and neutron mass. When subtracting the closest whole number from 
his measured masses for each isotope, Aston found that there was a significant 
deviation, a mass defect. Dividing by the mass number, he obtained what he 
called “packing fraction”. It is in fact roughly the binding energy per nucleon. 
In a Bakerian lecture in 1927 [216], he gave a list of many mass measurements 
with elements and isotopes up to a mass number of 200. When plotting packing 
fractions as a function of mass number, he found that most of them lay on 
a smooth curve. However, there were notable exceptions: nuclei like helium, 
carbon and oxygen have a much larger mass defect. Aston received the Nobel 
prize in chemistry in 1922 for the discovery of isotopes and the rule of whole 
numbers. 

These measurements inspired Georgiy Antonovich “George” Gamow, a 
physicist of Soviet origin born in Odessa. He had studied at the University of 
Leningrad, together with Dmitri Ivanenko and Lev Landau. Frustrated by his 
thesis work, he went to Gottingen to work with Max Born and Fritz Houter- 
mans. Léon Rosenfeld [499] remembered him as “a Slav giant, fair-haired and 


8For a more detailed history of nuclear mass measurements see [557]. 
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speaking a very picturesque German; in fact he was picturesque in everything, 
even his physics.” Gamow spent a year in Copenhagen and visited Ruther- 
ford’s laboratory in 1929. In a discussion session at the Royal Academy [236], 
he proposed a “simple model of the nucleus built from a-particles in a way 
very similar to a water-drop held together by surface tension.” This liquid 
drop model was the first attempt to formulate a model of the nucleus com- 
patible with quantum theory. While Gamow returned to the Soviet Union 
in 1931, the idea was quickly picked up by Heisenberg [249, 250, 255, 453]. 
He did not take a-particles but protons and neutrons as nuclear constituents, 
thus recognising the neutron as an elementary particle. In 1932 he wrote [249]: 
“If one wanted to take the neutron as composed of proton and electron, one 
would have to attribute Bose statistics and spin zero to the electron. It does 
not appear useful to elaborate such a picture any further. Rather the neutron 
shall be recognised as a fundamental constituent, which however is assumed to 
decay into proton and electron under suitable conditions, where conservation 
laws of energy and momentum may no longer be applicable.” These “suitable 
conditions” were elaborated shortly afterwards by Enrico Fermi in his theory 
of 6-decay, and energy-momentum conservation restored (see Section 9.2). 
The nature of the neutron as an elementary particle was further corroborated 
by Chadwick’s and Maurice Goldhaber’s observation of the deuterium split 
induced by gamma rays [258]. 

A long lasting version of the liquid drop model is the semi-empirical imple- 
mentation due to Carl Friedrich von Weizsäcker, German physicist, philoso- 
pher and brother of the later president of the Federal Republic of Germany, 
Richard von Weizsäcker. In 1936, he published the model [266] we explain in 
Focus Box 5.11. Scattering experiments demonstrated that the nuclear force 
was not only much stronger than the electromagnetic one, but also the same 
for protons and neutrons [264, 265], confirming one of the basic assumptions of 
the liquid drop model. Weizsacker’s work was popularised in the monumental, 
almost 500 page review of nuclear physics by Hans Bethe and collaborators 
in 1936/7 [270, 274, 276], which summarised the status of the field. 

The dynamics of nuclei when bombarded by neutrons and alpha particles 
was a subject of intense experimental and theoretical activity at the time. Ten 
years earlier, Rutherford had invented a satellite model where alpha particles 
orbited a core nucleus so that they could escape and give rise to a decay [223]. 
Gamow [225] as well as Ronald Wilfred Gurney and Edward U. Condon [227] 
had clarified, why a particles are able to leave the nuclear potential by quan- 
tum tunnelling, but cannot easily enter into it. Fermi and his group in Rome 
had bombarded uranium with neutrons and observed the creation of radioac- 
tive elements they thought to be trans-uranium nuclei [259]; their work laid 
the grounds for the discovery of nuclear fission. 

The father of nuclear chemistry, Otto Hahn, received his education at the 
University of Marburg, Germany, and originally intended a career in industry. 
To improve his language skills, he went abroad and worked with William 
Ramsay at University College London and Rutherford in Montreal. Returning 
to Berlin to pursue research in radioactivity, he met the Austrian physicist 
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Figure 5.6 Physicist Lise Meitner and chemist Otto Hahn in their laboratory. 
(Credit: imago images/leemage) 


Lise Meitner [678], who became his lifetime collaborator and friend. A photo of 
both in their Berlin laboratory is shown in Fig. 5.6. Meitner had obtained her 
doctorate in physics from the University of Vienna in 1905, only the second 
promoted woman in sciences at that university. In 1911, when Fritz Haber 
became the director of the physical chemistry institute of the newly created 
Kaiser Wilhelm Gesellschaft (today Max Planck Gesellschaft), he appointed 
Hahn as the head of its radioactivity institute. From 1918 onward, Meitner 
headed the institute’s physics department until she was forced to emigrate 
in 1938. To follow up on the work of the Fermi group and similar results 
by Irene Joliot-Curie and Pavle Savié in Paris 5], Hahn and his fellow 
radiochemist Friedrich Wilhelm “Fritz” Strassmann tried to isolate the final 
state components of the observed reactions of slow neutrons with uranium 
nuclei . On December 17, 1938, they succeeded in showing that a much 
lighter element, the alkali metal barium was one of the resulting nuclei 

|. Nuclear fission induced by slow neutrons had been discovered. That this 
was indeed the mechanism for this very exoenergetic nuclear reaction was first 
realised by Meitner and her nephew Otto Robert Frisch [288, 284], who were 
by then working in Stockholm and Copenhagen, respectively. While spending 
Christmas 1938 together, they used the liquid drop model to explain how 
the absorption of a neutron could deform the drop such that it would split 
into two smaller drops (see ). This happens when the surface 
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tension make this final state energetically favoured, as can bee seen from Focus 
Box 5.11. When Niels Bohr learned about this, he was all excited [394]: “Oh, 
what fools we have been! We ought to have seen that before.” He notified the 
New Yorker John A. Wheeler in early January 1939 [394] and together they 
formulated a quantitative theory [282], again using the liquid drop model. In 
particular, they calculated the neutron energy required to induce fission in 
various nuclei. 


Average binding energy per nucleon (MeV) 
P 
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Figure5.7 Average binding energy per nucleon as a function of the mass num- 
ber A. The most tightly bound nuclei are found in the iron group and in the 
so-called magic nuclei. (Credit: Wikimedia Commons) 


What the smooth curve of the liquid drop model completely misses are the 
“magic” nuclei, as Eugene Wigner called them [557], the ones with a number 
of neutrons or protons equal to 2, 8, 20, 28, 50, 82 and 126. For protons, this 
corresponds to He, O, Ca, Ni, Sn and Pb. Even more stable are the doubly 
magic isotopes, *He, 160, 4°Ca, *8Ca, 48Ni, °6Ni, !00Sn, 132Sn and 208Pb. An 
approach more close to quantum physics, describing the nuclear structure in 
analogy to the atom, is able to explain these exceptions. An early attempt to 
construct such a shell model of the nucleus appeared in 1932 by Dmitri Iva- 
nenko [248], another member of the three “Leningrad musketeers” together 
with Lev Landau and George Gamow. Such an approach seems natural, but 
there are important differences to the physics of the atom. First of all, while the 
electromagnetic potential is well known since Maxwell, the nuclear potential 
is not. It is not created by a fixed outside source like in the atom, but for each 
nucleon by all the others present in the same nucleus. A quantitative formu- 
lation using a square-well potential was developed by Maria Goeppert-Mayer 
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The liquid drop model, invented by Gamow and turned into a semi-empirical 
model with adjustable parameters by von Weizsäcker, was the first nuclear 
model to describe the binding energy Eg and the radius R of nuclei. The latter 
is given by a compact packing of A incompressible nucleons, such that R = 
roA!/3 and the nuclear volume is roughly proportional to A, the atomic mass 
number. The radius parameter is ro ~ 1.25fm. The (negative) binding energy, 
Ep = M(Z,A) — Zm, — (A — Z)m„, is described by the Bethe-Weizsacker 
formula: 


Ep = -aA + a9 A3 + a3 Z +a4 ta;A 2 
—- Sa’ 3 A 
j S AM Be 
Volume Surface ~y Pairing 


Coulomb Asymmetry 


It is roughly proportional to the number of nuclei, —aı A, thus the first term. 
However, there is a core where the nuclear force is saturated, surrounded by 
a surface layer where it is not. The latter diminishes the binding energy by 
+a2A3. This surface term is more important for light than for heavy nuclei. 
Coulomb repulsion between protons is taken into account by a term propor- 
tional to the number of proton-proton pairs Z(Z — 1) with the atomic number 
Z. Up to here, the considerations are purely classical. In particular, they do 
not account for the fact that light nuclei with an equal number of protons, Z, 
and neutrons, N = A — Z, are particularly stable; others have a less negative 
binding energy by the asymmetry term. In addition, there are more stable 
nuclei with both proton and neutron numbers even and few with both odd; 
this is described by the pairing term. All parameters a; are positive. The 
asymmetry term is positive and reduces the binding energy when N £ Z. 
The pairing term has a positive sign for odd N and Z, negative when both 
are even, and zero otherwise. The coefficients are determined by adjusting to 
measured nuclear masses. A recent result from 2228 experimentally measured 
atomic masses is [558]: 


ay = 15.777 0.037 ; a2 = 18.341 + 0.133 ; a3 = 0.710 + 0.002 
a4 = 23.211 + 0.060 ; a5 = 11.996 + 1.536 


The formula gives rather good results for heavy nuclei, less so for light ones. 
The exceptionally stable “magic” nuclei are badly represented (see Figure 5.7). 
The formula is especially useful to understand nuclear fission and fusion. Fis- 
sion splits a heavy nucleus (A >> 56) into an asymmetric pair of lighter ones, 
towards more tightly bound nuclei (see Focus Box 6.1). Fusion unites two light 
nuclei (A < 56) to form a more tightly bound heavier one (see Focus Box 6.4). 


Focus Box 5.11: Liquid drop model of the atomic nucleus 
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during her time in Chicago and at the Argonne National Laboratory in the 
1940s, and published in 1949 [314, 317, 318]. Simultaneous to her, Otto Haxel, 
J. Hans D. Jensen and Hans Suess devised an equivalent model [315]. Their 
subsequent trans-Atlantic collaboration resulted in a comprehensive book on 
the subject [329]. The basics of the nuclear shell model are explained in Focus 
Box 5.12. Goeppert Mayer, Jensen and Wigner received the 1963 Nobel prize 
for the shell model of nuclear structure. She was only the second Nobel lau- 
reate in physics after Marie Curie. No other female laureate in physics was 
elected until Donna Strickland in 2018. 

An attempt to unify the liquid drop model and the shell model was made in 
the 1950s by Aage Bohr, the son of Niels, Ben Mottelson and James Rainwater, 
who created the so-called collective model [341]. It also takes into account 
the interaction between nuclear protons and neutrons, which the shell mode 
treats separately. It combines collective and individual motion of nucleons 
and explains certain magnetic and electric properties which the shell model 
cannot account for. The three received the 1975 Nobel prize in physics for 
their work [426]. 

An ab-initio calculation of the nuclear potential had to wait for the field 
theory of Quantum Chromodynamics, QCD (see Section 9.2). However, appli- 
cations of nuclear physics by far preceded a complete understanding of strong 
forces. The enormous energy stored in nuclei caused impressive efforts to lib- 
erate and use them for peaceful as well as military purposes. We will elaborate 
on the latter in Chapter 6. 


5.6 QUANTA INTERPRETED? 


I am certainly not equipped to give you an overview of the many interpreta- 
tions of quantum mechanics which have appeared since its very beginnings. 
Nevertheless I believe that I cannot avoid to at least discuss my very naive 
view of what “really” happens in a quantum system, so that you can disagree 
with me. 

According to me, there are at least two aspects of quantum mechanics 
which are -or at least ought to be- in the focus of its interpretations and the 
discussion of its difficulties. The first one deals with the question of what the 
wave function “really” means. I have called it above an amplitude of probabil- 
ity. This probabilistic denomination has the same meaning as the amplitude in 
“real” waves like the electromagnetic one: it describes a quantity which evolves 
in space-time, as described by the quantum’s equations of motion. Its square 
gives you, for each point in space-time, the probability for the quantum object 
to be there, just like the square of the electromagnetic wave amplitude gives 
you its intensity at every point in space-time. The difference is of course, that 
the amplitudes of an electromagnetic wave are observable electric and mag- 
netic fields, while the probability amplitude is only observable via a product, 
with itself or with another wave. 
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The nuclear shell model is constructed in 
analogy to its atomic equivalent, but with 
a model potential. For a shell with princi- 
ple quantum number n, there are n sub- 
shells with orbital quantum number l =  j2g—.—-L. 


11/2 


0,1,2,...,(n — 1). For each one of them, + —# 
there are m, = —1,—!-+1,...,0,1,...,1- 2 = 
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Energy levels are shown on the right [321]. Protons 
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The potential can be modelled with a central part [327] and coupling between 


spin Õ and orbital angular momentum L [329]: 
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with the radius parameter R > lfm. The spin-orbit coupling removes the 
energy degeneracy and the levels partially overlap. Large gaps in energy 
appear each time a shell is completed. This explains the “magic” numbers 
of protons and neutrons and in particular the “doubly magic” nuclei like *He, 
160, 4°Ca,48 Ca,48 Ni,?6 Ni,208 Pb etc., which are especially stable. 
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Modern versions of the nuclear potential 
are based on Quantum Chromodynamics 
(see Section 9.2), implemented on a lat- 
tice [572]. Potentials calculated this way 
are shown on the left [591]. They have a 
shallow minimum at a distance of about 
lfm. The binding force at larger distances 
is transmitted by mesons [268]. At shorter 
distances there is a repulsive core which 
prevents nucleons from merging. 


Focus Box 5.12: 


Shell model of the atomic nucleus 
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Naive experimentalists like me may entertain a frequentist view of this 
probabilistic quantity. For a frequentist statistician, the probability of an event 
is the ratio between the number of times it occurs and the number of time 
it could have occurred. Thus, the phrase in the weather forecast, that the 
probability for rain tomorrow is 30%, makes no sense to a frequentist because 
there is only one tomorrow. For a Bayesian statistician, on the contrary, this 
statement makes perfect sense. For him, the probability just measures our 
degree of knowledge for what lies ahead. 

For a frequentist, an observation of a quantum can be interpreted as 
the single instance of a repetitive measurement process. A large number of 
measurements under identical conditions would map out the probability of 
the quantum’s presence as a function of space and time. Thus the question 
“where was the particle before I measured its position?” is meaningless. To this 
statement at least, the fathers of the Copenhagen interpretation of quantum 
mechanics, Born, Bohr and Heisenberg, would agree. Heisenberg wrote in his 
essay “The Development of the Interpretation of the Quantum Theory” [330]: 


The Copenhagen interpretation is indeed based upon the exis- 
tence of processes which can be simply described in terms of space 
and time, i.e. in terms of classical concepts, and which thus com- 
pose our “reality” in the proper sense. If we attempt to penetrate 
behind this reality into the details of atomic events, the contours 
of this “objectively real” world dissolve-not in the mist of a new 
and yet unclear idea of reality, but in the transparent clarity of 
mathematics whose laws govern the possible and not the actual. 


So he refuses to go further and discuss in what sense the wave function is 
“real”. In his introduction to Heisenberg’s “Physics and Philosophy” [359], 
Paul Davies sharpens these statements, saying that “the reality is in the obser- 
vations, not in the electron.” 

But a question doesn’t go away by declaring it illegal. Many, including me, 
are not content with abstaining from interpreting the wave function itself. 
Perhaps the most convincing (or least unconvincing) way of attacking the 
question of quantum motion originated from Louis de Broglie [218] and David 
Bohm [319] and is sometimes called the pilot wave interpretation. It inter- 
prets the wave function as a pilot which describes all possible paths a parti- 
cle can take. Its evolution is governed by the quantum equations of motion, 
Schrödinger, Klein-Gordon or Dirac depending on velocity and spin of the 
particle. The particle takes a definite path which is uncovered in the measure- 
ment, a notion that would appeal to a Bayesian statistician. The theory is 
thus explicitly non-local for coherent states, but each particle has a definite 
position at any moment, in as much as the uncertainty relation allows. Bohm 
writes [319]: 


In contrast to the usual interpretation, this alternative interpreta- 
tion permits us to conceive of each individual system as being in a 
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precisely definable state, whose changes with time are determined 
by definite laws, analogous to (but not identical with) the clas- 
sical equations of motion. Quantum-mechanical probabilities are 
regarded (like their counterparts in classical statistical mechanics) 
as only a practical necessity and not as an inherent lack of com- 
plete determination in the properties of matter at the quantum 
level. 


The wave nature of quantum states is of course not limited to motion, 
but also applies to stationary states, like atomic ones. What distinguishes a 
particle on a trajectory and a wave particle is interference. As far as this is con- 
cerned, there is no difference between electromagnetic waves [7] (which we are 
familiar with) and matter waves [219, 224] (which we are less familiar with). 
A double slit experiment can be realised with photons as well as electrons, 
and even leaving half an hour between the passage of subsequent photons or 
particles makes no difference to the appearance of an interference pattern in 
the observed intensity (see Focus Box 5.10). The reason is to be sought in the 
simple solutions to the equation of motion, i.e. plain waves (or their superpo- 
sition). Plain waves with fixed momentum are everywhere”, with fixed energy 
they are there for all times. There is thus an inherent non-locality in quantum 
phenomena. Once a system of particles is created, its coherence remains, until 
destroyed by an external intervention. This is exposed by interference experi- 
ments and by apparent paradoxes like the Einstein-Podolsky-Rosen one [263]. 
And it is put to fascinating use for example in quantum cryptography [536]. 

That there is destruction of coherence during the measurement process 
is also compatible with the fact that reactions happen or not, at random, 
but guided by a calculable probability. This in my view is the second question 
which one should focus on. When an interaction occurs, the Copenhagen inter- 
pretation claims that the wave function irreversibly “collapses” into point-like 
particles. The guide-wave interpretation does not see that happen, since the 
point-like nature of quanta is always preserved. Interactions can happen or 
not, at random and again in a wave-like manner, since interference between 
concurrent processes occurs. When they happen, they destroy coherence, oth- 
erwise the guide waves continue undisturbed and the particles follow them. 
This is as close as my naive notion of point-like particles moving like a wave 
gets to the thinking of great men. 

There is an interesting interpretation of the fact that quantum processes 
constantly cause boolean bifurcations, random yes-no decisions following cal- 
culable probabilities. In so-called “many-worlds” interpretations of this fact, 
this leads to a constant splitting of our universe into mutually exclusive his- 
tories. All of these are thought to be embedded in a “multiverse”, including 


9Note that “everywhere” in the quantum sense is to be taken with respect to the quantum 
scale of 10715m. Likewise a free particle is a bold abstraction, empty space does not exist, 
at least not for long (see Section 9.3). 
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an infinite number of possible evolutions. This may seem a little far-fetched, 
but to my knowledge there is no experimental fact contradicting this view. 

Ican only skim the surface of the interpretation problems quantum physics 
faces, only as deeply as my limited insight takes me. One of my favourite 
contemporary physicists, Steven Weinberg, has much more to say about this 
subject, both at the informal level [641, 645] and at the formal one [627]. The 
discussion has been going on since quanta were first interpreted and I cannot 
see an end to it. We will thus come back down to Earth rather violently and 
discuss the role of science and scientists in the two World Wars of the 20th 
century. 
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CHAPTER 6 


War time physics 


It’s my hypothesis that the individual is not a pre-given entity which 
is seized on by the exercise of power. The individual, with his identity 
and characteristics, is the product of a relation of power exercised over 
bodies, multiplicities, movements, desires, forces. 


Michel Foucault, Power/Knowledge, 1980 [439] 


ning of the 20th century, not sparing physicists. How the “great” war, 
rld War I, severed many precious links between German and essentially all 
other scientists. How ideology polluted physics and caused an unprecedented 
brain drain in Europe during Nazi reign. I cover the concentrated efforts on 
the nuclear bomb on both sides of the Atlantic during the early 1940s. In 
particular, I look into the complicated relation between Heisenberg and Bohr 
during the German occupation of Denmark. And I briefly discuss how cold 
war affected physics and physicists. 


T HIS CHAPTER discusses how nationalism infected society in the begin- 
o 


6.1 A “WAR OF THE MINDS”? 


When reading about science and scientists in World War I, one is often con- 
fronted to the view that it essentially was a chemists’ war. This is of course due 
to the devastating use of chemical warfare, associated with the name of Fritz 
Haber, who became the founding director of the Kaiser-Wilhelm Institute 
for Physical Chemistry and Electrochemistry (now the Fritz-Haber-Institute 
of the Max Planck Gesellschaft) in 1911. This and the Institute for Chem- 
istry were the first ones of the Kaiser-Wilhelm-Gesellschaft (KWG, now Max 
Planck Gesellschaft, MPG), in recognition of the importance of research for 
the rapidly growing chemical industry in Germany. The idea behind the KWG 
was to bring together academia and industry under the wings of the state, 
for the benefit of all three. Haber was a good choice, since he had already 
established close collaboration with Carl Bosch from the chemical company 
BASF when they together scaled up the catalytic synthesis of nitrates to an 
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industrial level. The resulting fertilisers contributed a lot to feeding people, 
while nitrate-based explosives killed many. 

On August 1, 1914, Germany declared war on Russia and the Kaiser 
ordered mobilisation of the troops. The same day, nine British scholars, includ- 
ing J.J. Thomson and William Ramsay, published a plea in The Times arguing 
against a British involvement [531]. Following Germany’s invasion of the neu- 
tral Belgium in August 4, Britain entered what was to become World War I. 
The use of chemical weapons was banned by the Hague Convention of 1907, 
to which Germany was a signatory. Notwithstanding this clear proscription, 
the use of gas shells was in the focus of the German Ministry of War. In 1916, 
Haber was named head of its chemistry section and led the teams developing 
chlorine and other lethal gasses, notably sulfur mustard (LOST) and phos- 
gene with the industrial chemists Wilhelm Lommel and Wilhelm Steinkopf. 
Haber personally attended the first release of gas grenades during the second 
battle of Ypres in spring of 1915. The medical officer of the French army’s 1st 
African Battalion described what happened [608]. The sky darkened with a 
yellow-green cloud. Asphyxiating vapours caused burning throats, pain in the 
thorax, shortness of breath and coughing-up blood. This eye witness survived, 
but few other soldiers were that lucky; of the 15,000 French soldiers in the 
sector, 5000 were dead and a similar number was made prisoner. With Haber 
as an advisor, a special troop for gas warfare was formed. He drafted physi- 
cists, chemists and other scientists to this unit, like Otto Hahn, Gustav Hertz 
and James Franck, who himself became a victim of a gas attack in 1917. 

An even deadlier gas was phosgene, developed by French chemists led by 
Victor Grignard (Nobel prize in chemistry of 1912) and first used by France 
in 1915 [515]. Around 37,000 tons of phosgene were manufactured during the 
war, out of a total of 190,000 tons for all chemical weapons, making it second 
only to chlorine. The German and French army were practically the only 
parties deploying it. Less notorious than LOST, phosgene nevertheless caused 
the vast majority of victims in World War I chemical warfare. 

The second notable intervention of scientists in World War I technology 
was the detection of submarines. The German U-Boot was hard to detect by 
the rudimentary hydrophones the allied navies possessed. Paul Langevin in 
France, a student of Pierre Curie, had suggested to use supersonic sound [540] 
instead of audible signals. Urgency was created when the British ocean liner 
“Lusitania” was sunk by the German Navy in May 1915 [635], killing about 
1200 people, including more than 100 U.S. citizens. In July, the British Admi- 
ralty set up the Board of Invention and Research and the Royal Navy set up its 
own Anti-Submarine Division. In the U.S., a Naval Consulting Board was set 
up in 1915 under the leadership of Thomas Alva Edison, but with little success. 
After the American entry into the war in 1917, British and French experts on 
underwater sound were called to a meeting with their American counterparts, 
followed by a meeting in Paris a year later. However, supersonic echo-ranging, 
sonar, was still in research by the end of the war. A more immediate success 
was met by the subterranean detection of telegraphic communications across 
the trenches [180]. 
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In the U.S., even before its entry into the war, the National Academy of Sci- 
ences founded the National Research Council [620] with the aim to coordinate 
fundamental and applied research for defence purposes. It was a decentralised 
body under the chairmanship of the astronomer George E. Hale, who had 
excellent international contacts. Robert Millikan served as its vice-chairman. 
In May 1917, a group of British and French physicists, including Rutherford, 
informed their colleagues of their own research, primarily on submarine detec- 
tion. Also there the success was very limited and did not have a significant 
influence on the outcome of the war. But the war certainly helped to organise 
governmental support and direction of science. 

A very disturbing involvement of scientists in the beginning of World War 
I is their contribution to war propaganda in late 1914, seen from within as 
a “War of the Minds” [164]. Fritz Haber was a war enthusiast of the first 
hour, signing with other scientists, writers and artists the so-called “Man- 
ifesto of the Ninety-Three” in October 1914 [488]. This manifesto had the 
signature of 15 otherwise respectable scientists, including the six Nobel lau- 
reates Planck, Röntgen, Ostwald, Wien, Fischer and Baeyer. Taken away by 
a surge of patriotism, without having first hand information to support their 
claims and sometimes without even knowing the exact text, the signatories 
formulated their “protest to the civilized world against the lies and calumnies 
with which our enemies are endeavoring to stain the honor of Germany in her 
hard struggle for existence” [166]. A notable absentee from the list of signa- 
tures is Albert Einstein, who had arrived in Berlin in March 1914 to head the 
Kaiser-Wilhelm-Institute for physics, which was to be created in 1917. Ein- 
stein had been German, stateless, Austrian-Hungarian and Swiss in his life 
and was profoundly uninterested in the notion of nations [168]: 


I am far from keeping my international conviction secret. How 
close a man or a human organisation is to me only depends on 
my judgement of his desires and abilities. The state to which I 
belong as a citizen does not play the faintest role in my feelings; 
I regard the affiliation to a state as a business matter, rather like 
the relation to a life insurance. 


With only three others!, Einstein signed a “Manifesto to the Europeans” [495] 
in mid-October 1914, which stated that “no war has ever so intensively inter- 
rupted the cultural communalism of cooperative work as this present war 
does.” The clear-sighted authors predicted: 


The struggle raging today will likely produce no victor; it will leave 
probably only the vanquished. Therefore it seems not only good, 
but rather bitterly necessary, that educated men of all nations 
marshall their influence such that -whatever the still uncertain end 
of war may be- the terms of peace shall not become the wellspring 
of future wars. 


lIt was redacted by the physiologist Georg Friedrich Nikolai, and signed by Einstein, the 
astronomer Wilhelm Julius Förster and the philosopher Otto Buek. 
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In the middle of nationalist fervour, this stayed the standpoint of a small 
minority. 

The “Manifesto of the Ninety-Three” backfired completely, causing out- 
raged reactions outside Germany. British scholars reacted with a counter- 
manifesto, signed by 117 scholars including Bragg, Rayleigh and Thomson, 
published as a “Reply to the German Professors” in The Times [161]. It con- 
cluded: 


We grieve profoundly that, under the baleful influence of a mili- 
tary system and its lawless dreams of conquest, [Germany] whom 
we once honoured now stands revealed as the common enemy of 
Europe and of all peoples which respect the law of nations. 


Both sides claimed that war had been forced on them, reflecting the view 
of their respective governments. The Würzburg professor Wilhelm Wien (see 
Section 5.1) went a step further and solicited colleagues to always balance 
the number of citations of British authors with an equal number of German 
papers. Stefan L. Wolff has meticulously analysed the history of Wien’s ridicu- 
lous initiative [531] using the abundant correspondence of physicists during 
that time. He found that, in addition to opposing the perceived “Engländerei” 
of German researchers, it was motivated by an aversion against the domina- 
tion of physics research by the liberal Berlin scientists. Soon, the initiative was 
taken over by more radical folkish-nationalist physicists like Philipp Lenard 
and Johannes Stark, trying to construct an antinomy between German and 
foreign physics. Opposition to the initiative was minor but notable. Friedrich 
Paschen, known for his contributions to atomic spectroscopy, and Emil War- 
burg, the president of the Physikalisch-Technische Reichsanstalt refused it 
vigorously. Max Planck, a rather moderate nationalist, argued that science 
and politics had to be kept strictly separate. Arnold Sommerfeld must have 
felt the antiscientific nature of the initiative, since he recommended to keep 
it secret. In any case, this attack against the global nature of science marks 
the infection of part of the German science community by views alien to sci- 
entific thinking. It culminated in the Third Reich in a movement in favour of 
a would-be “German physics”. 

World War I left about 8.5 million dead and more than 20 million 
wounded’. Of these, about 90,000 were killed by chemical weapons and many 
more left impaired. In German public opinion, the Nobel prizes attributed 
to German scientists shortly after the end of World War I, 1918 in physics 
(attributed in 1919) for Max Planck and the 1919 double prize for Johannes 
Stark in physics and Fritz Haber in chemistry, were taken as a ” German vic- 
tory” [489]. But at the end of the war, German scientists were excluded from 
international conferences for ten years [611]. German, a former language of 
sciences, was completely replaced by English. 


?See https: //www.britannica.com/event /World-War-I/Killed-wounded-and-missing 
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6.2 “GERMAN” PHYSICS 


Bigotry, nationalism, racism and other non-scientific intrusions into science 
of course did not suddenly appear in 1914 out of nothing. Scientists are not 
living in a political vacuum, they are citizens and often engage in the political 
and social questions of their time. A good example may be Max Planck [519], 
respectful towards hierarchies, a supporter of monarchy during World War I 
and signatory of the “Manifesto of the Ninety-Three”. He however maintained 
a respectable conservative attitude, refused any boycott of foreign institutions 
and lobbied for the access of women to higher education. 

In the period between the two world wars, Planck actively participated in 
the reconstruction of intellectual life in Germany. Indeed, the first World War 
left Universities bankrupt. In 1920, Planck helped create the “Emergency Asso- 
ciation of German Science”, together with Fritz Haber and the former Prus- 
sian Minister of Culture Friedrich Schmidt-Ott. Its objective was to provide 
the academic community with a central institution of fund raising, both from 
governmental sources and from industry. This organisation continued to exist 
through the Third Reich as “Deutsche Forschungsgemeinschaft” (DFG) and 
was re-established in 1945. Planck intervened in favour of fundamental research 
and opposed the folkish-nationalist and racist views of Lenard and Stark. In the 
1920s Stark published a book on “The present crisis in German Physics” [190] 
where he constructs an antinomy between an overrated theoretical physics, to 
which he attests a lack of intuitiveness, and an underrated experimental physics. 
Max von Laue dismantled his arguments in a review in 1923 [193], where he 
argued a difference in essence between the two branches of science, instead of a 
difference in value. He also rejected that it concerns only German physics since 
relativity and quantum physics shook up established convictions in all countries. 
He ended with the devastating conclusion: “All in all, we would have wished 
that this book had remained unwritten.” Stark left academia in 1920, Lenard 
resigned from the German Physical Society in 1925. 

Lenard and Stark had an early association to the Nazi party. While Adolf 
Hitler was locked up in the fortress of Landsberg and wrote his program- 
matic book “Mein Kampf”, they published an essay entitled “Hitlergeist und 
Wissenschaft” [197]. In this pamphlet, they claim emergency?: 


Do not be deceived, the Aryan-Germanic blood, the carrier of its 
unique spirit, is already in the process of disappearing rapidly; a 
racially alien spirit has been busily at work for over 2,000 years. 
The exact same force is at work, always with the same Asian people 
behind it that had brought Christ to the cross, Jordanus Brunus to 
the stake, and that shoots at Hitler and Ludendorff with machine 
guns and confines them within fortress walls: It is the fight of the 
dark spirits against the torchbearers in an endeavor to eliminate 
their influence on earth. 


3The English translations of documents in this section come from the anthology of Klaus 
and Ann M. Hentschel [487]. 
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In this pathetic rhetoric, they pledge adherence to the national-socialist move- 
ment, “with Hitler beating the drum.” In the same spirit, Lenard in his four- 
volume textbook “German Physics” [271] claims that relativity and quantum 
theory were Jewish inventions, to be replaced by superior Aryan-Germanic 
theory. In reality he revives the older conflict between theoretical and exper- 
imental physicists. In the introduction he writes: “The meaning of all well- 
tested natural laws is simple; they can thus be formulated not only using 
equations, but also using normal words. All undisputed scientific knowledge 
can be presented without a significant amount of mathematics.” The textbook 
was discussed in a series of articles in the press 1936. The student Willy Men- 
zel welcomed it in the Nazi propaganda newspaper “Völkischer Beobachter”, 
Heisenberg called it “erroneous and misleading” in a subsequent article. Stark 
in response again dug up his old claim of the priority of experiment versus 
theory, especially “Jewish” theory. He made the outrageous claim that theory 
had nothing to do with the discovery of the positron, X-rays and quanta, and 
concluded: “Theory must be brought back within bounds from its presumptu- 
ous position.” Anti-modernist and antisemitic prejudice was thus reunited by 
physicists disconnected from progress. In 1933, when Hitler came to power, 
Stark was made president of the PTR against the advice of experts. He also 
became president of the DFG in 1934. 

The mounting anti-semitism started to attack several great scientists, 
including Albert Einstein. In 1933, when Hitler became chancellor of the Ger- 
man Reich, Einstein decided not to return from a visit in the U.S. In May 
1933, Planck met Hitler and tried to intervene in favour of his Jewish col- 
leagues. To no avail. The alignment of German science with national-socialist 
power proceeded. Planck finally gave up all official functions in 1938. The Nazi 
government and party tried to steer academic appointments, but with limited 
success. Also the movement for “Aryan physics” of Lenard and Stark only 
had a handful of second rate adherents, mostly among their own students and 
collaborators. 

The prosecution of Jews in Germany, however, took full swing, when on 
November 9, 1938 a first organised pogrom by SA paramilitary forces and 
civilians ransacked Jewish homes, hospitals, businesses and schools. More than 
250 synagogues were destroyed all over Germany. Tens of thousands of Jewish 
men were imprisoned in concentration camps. The events were widely reported 
around the world. The correspondent of The Telegraph wrote on November 
11 [160]: 


An officially countenanced pogrom of unparalleled brutality and 
ferocity swept Germany today. Beginning in the early hours of this 
morning and continuing far into tonight, it puts the final seal to 
the outlawry of German Jewry. Mob law ruled in Berlin through- 
out this afternoon and evening and hordes of hooligans indulged in 
an orgy of destruction. I have seen several anti-Jewish outbreaks 
in Germany during the last five years, but never anything as nau- 
seating as this. 
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The exodus of German Jews had started as soon as Hitler came to power. 
It now accelerated’. By 1935, when the so-called Nuremberg racial laws 
became applied, about 65,000 Jewish citizens had left, including some 1600 
scholars, according to a list established by the “Notgemeinschaft deutscher 
Wissenschaftler im Ausland”. The Nuremberg laws formalised the policy to 
remove Jews, socialists, communists and other disagreeable activists from civil 
service. By 1936, this had led to the dismissal of more than 1600 university 
teachers, about 20% of the total teaching staff, including 124 physicists, again 
according to the “Notgemeinschaft”. Until 1942, about 330,000 more Jews 
were expelled, leaving all of their belongings behind. After the take-over of 
Austria in 1938, 150,000 Austrian Jews joined the forced expatriates. 

The racial cleansing of Universities mostly affected the physics faculty in 
Berlin and Göttingen due to their size and liberal orientation. According to 
Hentschel [487], it basically cut the scientific output in Germany by half as 
determined by bibliometry. Moreover, he finds that “the number of papers 
by future emigrés is positively correlated with the novelty of the topics and 
negatively with the conservativeness of a journal.” The loss is thus larger 
than the pure numbers suggest, since the more innovative physicists left. Also 
technology suffered. Between 1930 and 1940, the number of patents almost 
halved. Schrodinger, Franck and Haber left their positions in protest against 
Nazi policy, Sommerfeld went to Oxford in 1939. 

Preferred host countries of the expatriates were Britain with about 10% 
and especially the U.S. with almost 50%. Relief organisations were created 
in many countries immediately after mass emigration started. In addition, 
influential individuals like Rudolf Ladenburg, Wigner, Einstein and Weyl did 
their very best to help displaced colleagues. Despite the difficulties of many 
emigrés to become integrated in their host countries, these clearly profited 
from the influx of brain power, both quantitatively and qualitatively. Espe- 
cially the U.S. with its rapidly expanding R&D sector for nuclear physics 
presented many opportunities to newly arriving physicists. Notwithstanding 
strict security screening, many of them joined the weapons program after the 
U.S. entered into the war in 1941 (see Section 6.4). 

In the period between 1939 and 1941, the policy of the Nazi government 
escalated from enforced emigration to mass murder [569]. In January 1942 at 
the Wannsee Conference held near Berlin, Nazi officials formulated the plan for 
a “final solution to the Jewish question”, which culminated in the Holocaust, 
killing two thirds of the Jewish population in Europe. Bruno Tesch, a former 
student of Fritz Haber, was instrumental in the extermination at industrial 
scale. The Hamburg company Tesch & Stabenow which he co-founded deliv- 
ered the granular prussic acid (HCN) cartridges under the brand name Cyclon 
B to the gas chambers. By 1945, only about 25,000 Jews in Germany survived, 
out of an initial population of more than 560,000. Tesch and his deputy Wein- 
berger were sentenced to death by a British court marshal and executed in 


4Hentschel [487] has compiled the available statistics on emigration in general and on 
that of scholars in particular. The numbers quoted here are based on his findings. 
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1946. Prominent leaders of Nazi Germany who planned and carried out the 
Holocaust and other war crimes were trialed in a series of military tribunals 
in Nuremberg shortly after the war. The prosecution of concentration camp 
personnel still continues to this day. 


6.3_HITLER’S NUCLEAR BOMB? 


Directly after the discovery of nuclear fission by Hahn and Strassmann and 
its explanation by Meitner and Frisch (see Focus Box 6.1), a group of scien- 
tists formed in Germany, officially called “Working group for nuclear physics” 
and informally known as “Uranverein”, to look into technical applications 
of the enormous energy stored in nuclear matter. The Ministry of Educa- 
tion with its Reichs Research Council (Reichsforschungsrat, RFR) as well as 
the Ministry of War were notified about the potential technical and military 
application of nuclear power. In parallel, Siegfried Fliigge, an assistant to Otto 
Hahn, speculated about applications in a published paper [283], taken up by 
the press [485]. The early Uranverein had members from Gottingen (Georg 
Joos, successor of Franck, and Wilhelm Hanle), Heidelberg (Bothe and Wolf- 
gang Gentner), Leipzig (Gerhard Hoffmann, successor of Debye, and Robert 
Döpel), as well as Berlin (Geiger). It was a rather loose working group, there 
was no effort by political authorities to centralise this research. The work was 
interrupted after a few months since some of its members were drafted for mil- 
itary service. Meanwhile, Bohr and Wheeler worked out the theory of nuclear 
fission using the liquid drop model [282]. 

A new effort of collaboration was started after the beginning of hostilities 
in September of 1939. This second Uranverein now included Flügge, Heisen- 
berg from Leipzig and Carl Friedrich von Weizsäcker (see Focus Box 5.11) 
from the KWI for Physics in Berlin, Walther Gerlach from Munich and the 
physical chemist Paul Harteck from Hamburg. Kurt Diebner, advisor of the 
Army Ordnance Office (Heereswaffenamt, HWA), represented the military. 
At about the same time, the KWI for Physics in Berlin was taken out of 
the Kaiser-Wilhelm- Gesellschaft and subordinated to the military, with Dieb- 
ner as its administrative director replacing the Dutch Peter Debye who went 
to Cornell. It was especially supported by Erich Schumann, who also taught 
courses on acoustics and explosives, his areas of research, at Berlin University. 
Schumann was the doctoral advisor to Wernher von Braun, who had obtained 
his doctorate in 1934. Another strong political supporter was Abraham Esau, 
president of the Reichs Research Council. 

According to a 1939 memorandum by Heisenberg [287] and his post-war 
account [307], it was clear to the Uranverein members from the beginning 
that two applications of uranium fission were possible: a nuclear reactor using 
slow neutrons and moderately enriched uranium; or a nuclear explosive using 
uranium almost pure in the isotope ?°°U and fast neutrons. In fact, fission 
by slow neutrons is not usable for explosives. With all other parameters the 
same, the energy yield in a fission by thermal neutrons is a factor 10° less than 
by fast neutrons. The physics of nuclear fission is explained in Focus Box 6.1. 
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The entrance of neutrons into a nucleus is not hindered by the Coulomb 
barrier, they can become absorbed, (A, Z) +n — (A +1,Z)*. The final 
state nucleus is in an excited state and can undergo fission into two more 
tightly bound lighter nuclei. An example is the induced fission of uranium: 
235 +n — 137Ba* + 97Kr* + 2n. The nuclei in the final state are typical 
examples, many others are possible. The excited nuclei of the final state can 
cool off emitting further neutrons, the average total number is about 2.2. 


The binding energy is roughly 7.5MeV per nucleon for 


® eo 235U and 8.4MeV for the two daughter nuclei (see Fig- 
ure 5.7). Fission thus liberates the enormous energy 

( ) ipaa! of almost 1MeV per nucleon, or more than 200 MeV 
\ per uranium nucleus. So why doesn’t it occur sponta- 


Yo neously for most nuclei? The answer is the same as for 

a > spheroidal e.g. a decay: there is an energetic barrier to be crossed 

which is schematically shown below. When a nucleus 

t is excited, e.g. by an incoming neutron, its shape is 

€ m D emesse deformed from the spherical ground state as shown on 

+ the left. In the simplest case, there is an elongation 

€ » O anano along a single axis, by a distance e. Three cases can 
u n occur [567, p. 320]: 


(a) For light nuclei, deformation increases the 
potential energy steeply, as shown by the dot- (=) 
ted curve. The nucleus will normally return er we 
to its spherical ground state by getting rid ae 

of the excess energy, emitting e.g. neutrons i 
or photons. (b) For heavier nuclei, the bar- 
rier is shallower as shown by the solid curve. 
(c) For nuclei above Z?/A ~ 50, the barrier 
disappears (dashed curve) and fission occurs 
spontaneously. The height of the barrier, Ea, 
is called the activation energy. 


€ 

It mainly depends on the interplay between surface energy (which increases 
with e) and Coulomb energy (which decreases with €). The simplest deforma- 
tion is into an ellipsoid with elongation e, changing the surface and Coulomb 
terms of the liquid drop model (see Focus Box 5.11) like this: 


2 2 2 zZ? Ze 1 
agA3 — az A3 (1+ ze) e r (1 e) 


However the quantum mechanical terms also play a role. The isotope ?35U is 
an even-odd nucleus, ??®U a more tightly bound even-even one. This explains 
why fission of the latter is only induced by energetic neutrons with an energy 
above 6 MeV, while the lighter isotope is fissioned by neutrons of any energy. 


Focus Box 6.1: Nuclear fission 
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The daughter nuclei of the fission process are 


in an excited state and fall back to the ground m Bw 
state by emitting neutrons and photons. If $ Pa 
there are enough neutrons and if they have or BB Bi Bev 
the right energy range, a chain reaction can @ -w ex Bu 


be caused as sketched on the right. ?35U pro- =a — “8-8 . 

duces on average 2.2 neutrons per fission. The $ $ “Ba Eo 
ratio k = ni+1/ni, the effective number of neu- D eB — BB au 
trons n produced in stage i+ 1 divided by the 
number of neutrons from the previous stage i, ‘a u 
can be used to characterise the process. i 


The effective number of neutrons only counts those which are not absorbed by 
other materials or escape, and which have the right energy range. For k < 1, 
the process is called sub-critical and the reaction will die out exponentially. 
Critical conditions, k = 1, correspond to a steady state with a constant number 
of neutrons and a constant energy production: this is the condition required 
for a nuclear reactor. In the over-critical state, k > 1, the number of neutrons 
and fission reactions grows exponentially with time. This is the necessary 
condition for a nuclear explosion. 


Focus Box 6.2: Nuclear chain reaction 


The rare component of natural uranium, ?35U, can be fissioned with neu- 
trons of any energy and emits further neutrons, thus potentially establishing a 
chain reaction. The majority component of natural uranium, ?38U has a high 
absorption cross section for fast neutrons and thus hinders the chain reaction. 
We explain the basic physics of nuclear chain reactions in Focus Box 6.2. A 
chain reaction in only moderately enriched uranium requires a moderator, a 
substance which thermalises neutrons without absorbing them. Usage as an 
explosive requires technology to separate the two isotopes of uranium. 

These two requirements defined the strategy for pursuing applied nuclear 
physics by the Uranverein. Concerning the reactor line of research, a search 
for an efficient moderator concentrated on two substances: heavy water D20 
and pure carbon in the form of graphite. The absorption cross sections for the 
two materials were measured in 1940, by Döpel and Heisenberg in Leipzig for 
heavy water, and by Bothe and collaborators in Heidelberg for graphite. The 
pure graphite used for absorption measurements by the latter had residual 
impurities of boron and cadmium, resulting in much too high absorption [293, 
302]. It was thus decided to concentrate the efforts on moderation with heavy 
water. 

Heavy water is a natural component of water, though only at a level of 
0.01%. The main supplier at the time was a company in Norway close to 
Rjukan, which was running a hydroelectric power plant delivering 108 MW. 
Part of the power was used for nitrogen separation and fertiliser produc- 
tion. Another part was used for water electrolysis. In the residual water after 
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electrolysis, the proportion of heavy water is enriched. After the German inva- 
sion of Norway in April 1940, the plant was requisitioned and improved by 
Harteck. Its peak production rate was about 200 litres of D20 per month. 
The plant was sabotaged by partisans and bombarded by allied forces sev- 
eral times, in 1943 it had to cease production. The required uranium was 
ordered from the Auer company in Oranienburg, run by the SS. It used natu- 
ral uranium coming from mines in Jächimov, close to Karlsbad in the occupied 
Sudetenland. However, purification experiments by Harteck and others with 
gaseous UF¢ failed. 

In 1940, Weizsäcker [292] pointed out that in a nuclear reactor also the 
isotope 7°°U would be produced. In its decay chain, elements 93 and 94, nep- 
tunium and plutonium, would appear, with fission properties similar to 7°°U. 
There was no experimental proof, since this would have required a working 
reactor, but the conclusion was considered likely [307]. Indeed plutonium, 
239Pu, can be used to construct a nuclear bomb. This possibility of breeding 
nuclear fuel was used in the Manhattan Project. 

In summer of 1941, about 150 litres of heavy water were available and 
an experimental set-up, named L-IV, was built in Leipzig to measure the 
multiplication rate for neutrons in layers of uranium, separated by heavy water 
in a spherical geometry. Using metallic uranium, Robert and Clara Döpel 
together with Heisenberg measured a multiplication rate greater than one [296] 
in 1942, indicating that a scaled-up version of the reactor would sustain a chain 
reaction and produce energy. The L-IV exploded in June 1943, when air leaked 
in during an inspection, creating the world’s first nuclear accident. 

The results were reviewed in 1942, in meetings with the minister of Science 
and Education, Bernhard Rust, and the minister of Armaments and War 
Production, Albert Speer. It was decided to continue research on the existing 
low level. The declared goal was to construct a nuclear reactor capable of 
driving machines, like in U-boots or other marine vessels. The KWI for Physics 
in Berlin was returned to the Kaiser-Wilhelm-Gesellschaft and Heisenberg 
appointed as director. Under difficult war-time conditions, the research led to 
the construction of an experimental reactor in Berlin Dahlem and a subsequent 
model in an underground facility in Haigerloch close to the Black Forest. 
In early 1945, the latter came close to a critical multiplication of neutrons 
without actually reaching it. On April 22, 1945, Haigerloch was occupied by 
U.S. troops. The reactor was confiscated, dismantled and later destroyed. 

The sequence of events briefly sketched above is basically undisputed by 
historians and contemporary witnesses. What is the subject of sometimes 
passionate debate is the motivation of German physicists in the Uranverein 
and the reason for their failure to pursue the construction of a nuclear bomb. 
Heisenberg and Weizsäcker emphasise their reluctance to work on nuclear 
weapons due to lack of resources and time. They advance that despite the 
already difficult military situation, the military in 1942 still believed in a soon- 
to-come Final Victory (“Endsieg” ). For themselves, they state their belief that 
the war would have fatally ended anyway before a nuclear bomb would have 
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been available, and that non-military applications would be more important 
after the war ended. 

The Dutch-American Samuel Goudsmit, proponent of electron spin 
together George Uhlenbeck (see Section 5.3), developed a very different point 
of view. He was the chief scientific advisor of the Alsos mission, an intelli- 
gence exercise part of the Manhattan Project, created after the allied invasion 
of Italy in 1943. Its aim was to collect intelligence related to German nuclear 
research, but also chemical and biological weapons. Their assignment was 
to seize personnel, documents and material to evaluate the state of German 
research and prevent it from falling under concurrent control. Alsos person- 
nel closely followed the progressing front line and sometimes even crossed into 
enemy territory. They reached the left-overs of Uranverein in Hechingen, where 
the KWI for Physics had relocated. Among others, Alsos took Heisenberg, 
Hahn, von Laue and Weizsäcker into custody and interned them in the Farm 
Hall estate in England, where their conversations were surreptitiously taped. 
In many articles published in 1946 and 1947, as well as his book “Alsos” [458], 
Goudsmit concludes that the Germans did not get anywhere close to creating 
a weapon. He basically claims that his German colleagues had not understood 
how a nuclear bomb would function at a basic level, let alone at an engineer- 
ing level. Heisenberg took this as an attack on his understanding of nuclear 
physics, and tried to counter it with an article in Nature [308]. An exchange 
of letters followed, where each one stood his ground, until both men agreed 
that it would be better to only discuss physics. 

While incompetence and unwillingness of the protagonists and the Nazi 
authorities are claimed to be the main drivers for the Uranverein’s failure, it 
is clear that Germany did not have and did not want to create the necessary 
resources for a serious attempt on nuclear weapons. The funds made available 
to the Uranverein’s activities, estimated to be about 2M$, were at a per mil 
level of the U.S. effort in the Manhattan Project. At the height of Germany’s 
activities in 1942, no more than about 70 scientists worked on nuclear applica- 
tions. The only German development effort during World War II which came 
anywhere near the Manhattan Project was the rocket program led by Wernher 
von Braun in Peenemünde. In 1942, it employed an estimated 2000 scientists 
and twice as many workers and costed at least half a billion dollars. 

The historical discussion on the motivations of the German nuclear scien- 
tists often converges on a single person and a single incident, Heisenberg’s 1941 
visit to Niels Bohr in Copenhagen, occupied by German troops since April 
1940. The visit is to be seen in the context of the propaganda lecture tours 
of Heisenberg, von Weizsäcker and others to occupied countries [470]. Heisen- 
berg used such an occasion to see Bohr in September 1941. On the advice 
of Weizsäcker, he brought up German nuclear research and the possibility of 
a bomb based on the principle of nuclear fission. Bohr was deeply shocked 
and angry, and stopped the meeting abruptly. Heisenberg and von Weizsäcker 
claimed after the war that Heisenberg was merely seeking Bohr’s advice on 
what to do, insinuating that he was trying to withhold crucial information 
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from Nazi authorities [402]. But Bohr took it as a threat by a representa- 
tive of an enemy country, trying to squeeze information about allied activities 
out of him and even ask for his collaboration. Heisenberg’s point of view was 
reflected in Robert Jungk’s very popular book “Brighter than a Thousand 
Suns” [333] published after the war, based on interviews with von Weizsäcker 
and Heisenberg himself. When Bohr learned about it in 1957, he drafted a let- 
ter to Heisenberg in response, but never sent it [532]. In the draft, he described 
what he recalled from the meeting, his anger and disappointment. 

Since there are no witnesses to this encounter, which destroyed the old 
friendship and close working relationship between the two men, it has inspired 
artistic interpretations. Poetic accounts are to be discovered in a short story by 
Per Olov Enquist [494] and a novel by Jeröme Ferrari [619]. The most remark- 
able re-enactment for me is Michael Frayn’s award winning play “Copen- 
hagen” [518], which first ran in London in 1998 and on Broadway in 2000. It 
triggered a lively discussion among scholars, e.g. in the New York Review of 
Books [523, 524, 528, 533, 534, 538, 539, 547, 637]. In 2002, it also prompted 
Niels Bohr’s family to release the unsent letters to Heisenberg which Bohr had 
drafted between 1957 and 1962 [532]. 

There are many books describing the German effort for a nuclear reactor 
and nuclear explosives during World War II; some I have consulted are listed 
in the Further Reading section of this chapter. If you wonder which books 
to read, I recommend that you consult the very fair comparative review of 
Klaas Landsman [535]. Some of the authors subscribe to Heisenberg’s and 
von Weizsäcker’s post-war claim that they, for lack of collusion with Nazi 
authorities, deliberately held back crucial information such that there was no 
serous bomb project. Their line of arguments is that they actively supported 
the reactor project, with mainly post-war applications in mind. In an inter- 
view, Edward Teller (see Section 6.4) went so far as to say that Heisenberg 
sabotaged the bomb effort [661]. 

Goudsmit’s point of view of German incompetence, supported e.g. by Paul 
L. Rose [507], is often criticised, documentation and witnesses at hand. Heisen- 
berg and his Uranverein wrote at least two reports to their authorities, which 
show that they clearly understood the difference between a controlled chain 
reaction in a reactor and an uncontrolled one in a bomb [662], differences in 
the required fuel and moderator materials as well as the process itself. Von 
Weizsäcker even tried to get a patent on plutonium breeding from a nuclear 
reactor and its usage for explosives [662]. An argument in favour of incom- 
petence seems to be the reaction of German internees in Farm Hall [525], 
when they heard about the nuclear bombing of Hiroshima and Nagasaki. It 
appears that Heisenberg had never seriously calculated the critical mass (see 
Focus Box 6.3) of pure 7°°U or plutonium which it would take to construct a 
nuclear bomb. Only at Farm Hall did he get a correct estimate, even though 
all necessary information had long been available to him [626]. 

Goudsmit generalised his findings [458]: “I think the facts demonstrate 
pretty conclusively, that science under fascism was not, and in all probability 
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Producing more neutrons in each step of a fission reaction is not sufficient to 
establish a chain reaction. As the neutrons diffuse through the material, they 
must have a sufficient chance to cause another fission before being absorbed 
or lost through the outer boundary. If absorption is negligible, the number of 
produced neutrons is proportional to the volume, the number of lost ones to 
surface. There is thus a critical size, where both are equal. For larger sizes, 
the number of fissions grows exponentially, for smaller ones is dies out. For 
a known shape, this size corresponds to a mass of fissionable material, the 
critical mass. 

In pure 7°°U, neutron absorption is negligible and neutrons of about 1 MeV 
kinetic energy are fast enough to cause a chain reaction in a material of suffi- 
cient size, before the assembly blows up thermally. In a large amount of active 
material, the number of neutrons per unit volume, N (Z,t), follows a diffusion 
equation, with diffusion coefficient D: 


N =i 
Nop ys In 
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This is a continuity equation for N and its current density j = -DVN , witha 
source term on the right describing the number of neutrons produced per unit 
time, using the average number v of neutrons produced per fission and the 
mean time 7 between fission reactions. A stationary solution, corresponding 
to reactor conditions, is defined by N/ðt = 0. For an exponential growth in 
fission reactions, we must have N (Z, t) = N, (£)e” "/", where v is the effective 
number of neutrons amenable to fission reactions and 7 is the average time 
between two of those. We assume a spherically symmetric active region, such 
that Ni(@) = Nı(r). In addition, the neutron density and current must be 
continuous at the outer boundary. We then find for the spatial part of the 
neutron distribution: 
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Ni(r)=0(r<R) ; V?N(r)=0(r>R) 

The solution is Ny = sin (rr/R)/r, provided that v’ = (v — 1) — r? Dr/R?. For 
an exponential growth, v’ must be greater than zero. There is thus a critical 
radius Re = rl with the so-called diffusion length | = ,/Dr/(v — 1). Using 
numerical values available at the time for the required constants [299, 626], 
one gets a critical radius of about 13cm and a critical mass of almost 200kg, an 
overestimate by roughly a factor of 3. This is due to the fact that the diffusion 
equation to be valid requires a size much larger than the diffusion length, 
which is not the case. The full calculation is most easily done by numerical 
simulation. Enrico Fermi, Stanislaw Ulam, Nicholas Metropolis and John von 
Neumann [310, 461, 464] developed the so-called Monte-Carlo method for this 
purpose. It is used in particle physics today to simulate the passage of particles 
through active and passive materials. 


Focus Box 6.3: Critical mass 
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could never be, the equal of science in a democracy.” That totalitarian regimes 
are unable to organise a coherent scientific effort, even for military purposes, 
unfortunately turned out to be wrong. Shortly after his book first appeared 
in 1947, the Soviet Union conducted its first test of a nuclear bomb. 


6.4 THE MANHATTAN ENGINEER DISTRICT 


Landsman summarises the conditions which had to be satisfied for a successful 
crash program to develop and produce a nuclear bomb [535]: 


1. There was a strong initial drive by a small group of physicists to get the 
program off the ground. 


2. From a certain point in time, there was unconditional support from he 
Government. 


3. Practically unlimited industrial resources and man power were available. 


4. There was an unprecedented concentration of brilliant scientists working 
on the project. 


In Germany the first condition was met, but none of the others were. Heisen- 
berg always emphasised the lack of conditions 2 and 3. In addition, the last 
condition could not have been satisfied because of the massive forced exodus of 
first class physicists from Germany and occupied countries. On the contrary, 
all conditions were clearly met in the U.S. in the context of what is called 
the Manhattan Project”, soberly described in the report “Atomic Energy for 
Military Purposes” by Henry DeWolf Smyth [305], a massive secret effort to 
develop nuclear weapons with the official code name “Manhattan Engineer 
District”. 

Allied efforts to use nuclear energy for weaponry started on both sides of 
the Atlantic shortly after the discovery of nuclear fission. Just before the first 
hostilities between Germany and Poland began in September 1939, the Hun- 
garian émigré Leo Szilard and Albert Einstein wrote a letter to President Roo- 
sevelt® alerting him to the possibility that Germany would use nuclear fission 
for military purposes. President Roosevelt created a governmental committee 
coordinating and modestly funding nuclear research, both on a reactor and on 
explosives. In 1940, Alfred Nier from Minnesota and his colleagues at Columbia 
verified the prediction by Bohr and Wheeler that the isotope ?35U was responsi- 
ble for fission with fast neutrons [290, 291]. They concluded [291]: “These experi- 
ments emphasize the importance of uranium isotope separation on a larger scale 
for the investigation of chain reaction possibilities in uranium.” 


5The chronology of the Manhattan Project and many primary docu- 
ments can be found on the websites of the U.S. Department of Energy, 
https: //www.osti.gov/opennet/manhattan-project-history/, and the National Security 
Archive, https: //nsarchive2.gwu.edu//NSAEBB/NSAEBB162/index.htm 

Öhttps://www.osti.gov/opennet/manhattan-project-history/Resources/einstein. 
letter_photograph.htm#1 
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In Britain, Marcus Oliphant in Birmingham had attracted the émigrés 
Rudolph Peierls and Otto Frisch to his University. Alarmed by the possibility 
that Germany would make an effort towards military use of nuclear fission, 
Frisch and Peierls wrote a two-part paper, a technical and a non-technical one 
collectively called the “Frisch-Peierls Memorandum””, to explain the physics 
and rough parameters of a nuclear bomb. The original is kept in the Bodleian 
Library of Oxford University. Due to only approximately known properties of 
Uranium, they came up with an overly optimistic estimate of the critical mass, 
roughly one pound. The memorandum made it into governmental channels 
and an advisory committee, called MAUD, was formed to coordinate research 
on nuclear explosives at British universities. Experimental data confirmed the 
memorandum’s punch line and convinced sceptics like James Chadwick that a 
nuclear bomb was feasible. The final report of the MAUD committee in 1941 
had a considerable impact on both sides of the Atlantic [614, Section 3.7]. On 
the initiative of Gregory Breit, the National Research Council set up a self- 
censorship committee in 1940, to withhold publications on nuclear issues until 
after the war [429]. All relevant research papers were only published after the 
war, but quoting their original submission date. 

The Japanese attack on the U.S. fleet in Pearl Harbor of December 7, 
1941, catapulted the United States into the war. The following month Presi- 
dent Roosevelt secretly gave his tentative approval to proceed with the con- 
struction of a nuclear bomb. By early 1942, a series of military defeats in the 
Pacific caused the pace to be accelerated, following two lines of research and 
development, a uranium and a plutonium line. A bomb made of pure 2°°U 
required isotope separation at an industrial level. Enrichment seemed feasible 
by gaseous diffusion through fine meshes, which Frisch and Peierls had already 
envisaged; by gas centrifuge; or by spectroscopic separation. A bomb made of 
plutonium seemed a quicker route, since ?39Pu can be chemically separated 
from other elements. However, it must be bred in a nuclear reactor by the reac- 
tion chain ?®U + n > "U > ?29Np e7 >. — 739Pu 2e7 20e. The two beta 
decays have half lives of 23 minutes and 2 days, respectively. Plutonium itself 
has a long lifetime, more than 24,000 years. This line of research was led by 
Arthur Compton at the new Metallurgical Laboratory (Met Lab) at the Uni- 
versity of Chicago, today the Argonne National Laboratory. Enrico Fermi’s 
work at Columbia and Glenn T. Seaborg’s work at Berkley were moved to 
Chicago. A review in May 1942 found that progress had been made on all 
paths to uranium enrichment as well as plutonium breeding. However, there 
was no clear front-runner. Since there was no time to be lost if the bomb 
was to be available for war-time usage and since there was no room for wrong 
decisions, it was decided to follow all paths in parallel. At the same time, large 
parts of the project were put in the hands of the Army Corps of Engineers, 
which set up the Manhattan Engineer District (MED) to manage the project, 


“See transcripts in http://www.atomicarchive.com/Docs/Begin/FrischPeierls. shtml 
and http://www.atomicarchive.com/Docs/Begin/FrischPeierls2.shtml 
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under the leadership of general Leslie R. Groves. A site at Oak Ridge, Ten- 
nessee was selected for the construction of production plants, concentrating 
on spectroscopic and gaseous diffusion and dropping the centrifuge process. J. 
Robert Oppenheimer was appointed head of the bomb research and design lab- 
oratory to be built at Los Alamos, New Mexico. In December 1942, President 
Franklin Roosevelt gave his final authorisation to proceed with construction 
of the nuclear bomb. 

On December 2, 1942, researchers in Chicago headed by Enrico Fermi 
achieved the first self-sustaining chain reaction in a graphite-moderated ura- 
nium reactor. Based on this success, DuPont company constructed an exper- 
imental production reactor, and a pilot chemical separation facility at Oak 
Ridge. Met Lab scientists designed the water-cooled plutonium production 
reactors for the Hanford site in the U.S. state of Washington. Starting in 1943, 
three production reactors and corresponding chemical separation plants were 
built, with the first one operational in late September 1944. Los Alamos received 
its first plutonium from Hanford in early February 1945. Neutrinos produced by 
this reactor were used by Frederic Reines and Clyde Cowan to directly demon- 
strate the existence of the electron neutrino v. in 1953 (see Section 9.1). 

The isotope separation at Oak Ridge went into production rather slowly 
due to the immense technical difficulties with both electromagnetic and 
gaseous diffusion technique at industrial dimensions. Both were eventually 
successful, as well as the liquid thermal diffusion technique brought in as a 
back-up. In the end, it took the combined efforts of all three facilities to pro- 
duce enough enriched uranium for the one and only uranium bomb produced 
during the war. 

Meanwhile at the Los Alamos site, close to Santa Fe in New Mexico, scien- 
tists and engineers were busy turning theory into a viable bomb design. New- 
comers were given a series of lectures by Robert Serber and Oppenheimer [299]. 
The problem is how to almost instantly bring together sub-critical pieces of 
fissionable material to exceed the critical mass. Efforts for the uranium bomb, 
led by Seth Neddermeyer, concentrated on the rather simple gun-type design, 
where a piece of uranium is shot into a uranium target by ordinary explosive. 
Plutonium, on the other hand, is likely to melt before the two pieces could 
come together. Von Neumann proposed an alternative ignition method, where 
the fissionable material is compressed by a surrounding explosive to reach crit- 
ical mass density. The engineering for this method is much more complex and 
required a test of the technology. 

In June 1944, allied troops landed in France and the war in Europe entered 
into its final phase. In parallel, the Alsos mission had determined that Ger- 
many was nowhere near mastering nuclear weapons technology. Germany as 
a target for a nuclear bomb thus ceased to be of primary interest and Japan 
came into focus, with the aim to end the war in the Pacific. In the first months 
of 1945, the facilities at Oak Ridge and Hanford produced enough enriched 
uranium and enough plutonium for at least one bomb each. At Los Alamos, 
bomb designs were finalised and ready for testing. 
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With the construction of a nuclear bomb at arm’s length, a debate started 
on whether and how to use it. In spring of 1945, the Secretary of War of 
the Truman administration formed a so-called Interim Committee to examine 
the question. It was composed of representatives of the administration, the 
military, industry and science. Robert Oppenheimer, Enrico Fermi, Arthur 
Compton, and Ernest Lawrence served as scientific advisors. The debate 
turned around the question whether it was a sufficient threat to Japan to 
announce and launch a demonstration bomb on uninhabited territory, or if 
a target in Japan was to be hit in an unannounced attack. The aim was to 
stop the war in the Pacific either way. The committee rejected the first alter- 
native as too dangerous, because of the possibility of enemy preparation and 
intervention, but also the risk that the demonstration could fail. It was thus 
decided to recommend the launch on a military complex surrounded by work- 
ers’ homes in a Japanese city. The committee also recommended that after 
the war ended, nuclear technology should be controlled by an international 
organisation, which would have the right to inspect installations on-site. In the 
absence of such an organisation, it advocated to try and preserve American 
superiority as long as possible. The conclusions were challenged by scientists 
at Met Lab, including James Franck, Glenn Seaborg and Leo Szilard. They 
argued that a preemptive strike on Japan without a prior threatening demon- 
stration would inevitably cause an arms race and that international control of 
the proliferation of nuclear arms was of primordial importance. However, the 
Interim Committee stuck to their point of view, which was communicated to 
President Truman during the Potsdam conference. 

To verify the technology of implosion ignition required for a plutonium 
bomb, a test was conducted in mid-July 1945 in the New Mexico desert, under 
the code name “Trinity”. It was attended by several hundred people, includ- 
ing Leslie Groves, James Chadwick, Enrico Fermi, Richard Feynman, Robert 
Oppenheimer and John von Neumann. The test was preceded by exploding a 
demonstration device for the implosion mechanism, with almost 100t of con- 
ventional explosive mixed with radioactive material. In the early morning of 
July 16, the full nuclear device was fired. A photograph taken from 20 miles 
away is shown in Figure 6.1. Richard Feynman witnessed the test from behind 
a truck windshield [438]. He remembered’: 


So I look back up, and I see this white light changing into yellow 
and then into orange. The clouds form and then they disappear 
again; the compression and the expansion forms and makes clouds 
disappear. Then finally a big ball of orange, the center that was 
so bright, becomes a ball of orange that starts to rise and billow a 
little bit and get a little black around the edges, and then you see 
it’s a big ball of smoke with flashes on the inside of the fire going 
out, the heat. 


8The quote comes from a transcript of Feynman’s lecture at Santa Barbara in 1975, 
which you can listen to at https://www.youtube.com/watch?v=hTRVLUT665U 
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All this took about one minute. It was a series from bright 
to dark, and I had seen it. I am about the only guy who actually 
looked at the damn thing, the first Trinity test. Everybody else had 
dark glasses, and the people at six miles couldn’t see it because 
they were all told to lie on the floor. I’m probably the only guy 
who saw it with the human eye. 

Finally, after about a minute and a half, there’s suddenly a 
tremendous noise - BANG, and then a rumble, like thunder — and 
that’s what convinced me. Nobody had said a word during this 
whole thing. We were all just watching quietly. But this sound 
released everybody — released me particularly because the solidity 
of the sound at that distance meant that it had really worked. 


Figure 6.1 Amateur photograph of the 
Trinity test explosion of the first nuclear 
bomb on July 16, 1945. (Credit: Photo 
courtesy of Los Alamos National Labo- 
ratory) 


President Truman was promptly informed of the successful test and used 
this new strengthening of the U.S. position in the negotiations about a post- 
war order for Europe. He did not know that the Soviet Union had information 
about the U.S. program since 1941 and was well aware of the situation. The 
final order for a nuclear attack on Japan was given on July 25, in a directive 
drafted by Leslie Groves. It ordered the Army Air Force to attack Hiroshima, 
Kokura, Niigata, or Nagasaki (in that order of preference) as soon after August 
3 as weather permitted. 

The first bomb, a uranium gun-type nicknamed “Little Boy”, was dropped 
on Hiroshima on August 6, 1945. It detonated half a kilometre above a 
Japanese army training field. It contained little more than the critical mass of 
235U, with an effective explosive power of 15k tons of TNT, i.e. 63 TJ. This 
actual yield was only 3% of the theoretical energy yield; most of the ?°°U 
was dispersed in the explosion before contributing to the chain reaction. The 
bomb caused widespread death and destruction in the densely populated city 
of Hiroshima. An estimated 70,000 people died as a result of initial blast, heat, 
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and radiation effects”. By the end of the year, some 30,000 more were killed 
by radioactive fallout and other collateral effects. There is no official statistics 
of the long-term death toll due e.g. to increased probability of cancer, but it 
may amount to similar numbers. 

The second launch, with a plutonium implosion-type bomb nick-named 
“Fat Man”, took place over Nagasaki on August 9, 1945. The target had been 
chosen by the aircraft commander because of technical difficulties and bad 
weather, even though it was low on the pre-established list of targets. The 
bomb exploded again half a kilometre above the city, in the middle between 
two war industries, Mitsubishi-Urakami Torpedo Works and Mitsubishi Steel 
and Arms Works. Despite a partial evacuation, there were almost 200,000 peo- 
ple living in Nagasaki. The estimated yield was 21 kt of TNT, thus 40% larger 
than that of “Little Boy”. Because of the topography of the area and launch 
site outside of the city, the destructions were less than at Hiroshima, but nev- 
ertheless substantial!°. About 40,000 people died immediately, 70,000 within 
a year and twice as many within the next five years. In the area immediately 
under the detonation, the death rate was comparable to that at Hiroshima. 

As a consequence of these two attacks, Japan offered surrender to the Allies 
on August 10, provided that Emperor Hirohito be spared. On August 12, the 
U.S. accepted the surrender, on the condition that the Emperor would retain 
only a ceremonial role. World War II had ended due to nuclear power. 

Although collaboration across science, the military and industry, piloted by 
government, was first implemented in World War I, the Manhattan Project 
pushed back all previous boundaries. At its height, it employed more than 
130,000 people and costed an estimated 2 billion dollars in 1945 [360] (equiva- 
lent to ten times that much in today’s US$). In Los Alamos alone, it united the 
creme de la creme of nuclear physicists of the time. The DoE web site!! lists 
Hans Bethe, James Chadwick, Albert Einstein, Enrico Fermi, Richard Feyn- 
man, James Franck, Klaus Fuchs (see Section 6.5), Joseph Rotblat, Glenn 
Seaborg, Robert Serber, Leo Szilard, Edward Teller, Eugene Wigner and Her- 
bert York, no less than eight Nobel laureates. This count does not include the 
numerous other contributors at Universities and visitors from abroad like Niels 
and Aage Bohr. It is thus clear that Landsman’s fourth condition was more 
than fulfilled, a concentration of brain power that could have been equalled 
nowhere in the world. 

It is not possible here to seriously estimate the total death toll of World 
War II [586]. In Europe, many tens of millions died under the reckless attack of 
Nazi Germany, including six million Jews and other minorities, victims of the 
Holocaust. The loss of life in the Pacific war was equally horrific, especially in 
China, where an estimated 15 to 20 million people died during the Japanese 


The death toll estimations come from the U.S. Department of Energy, 
https: //www.osti.gov/opennet/manhattan-project-history/Events/1945/hiroshima.htm 
10The death toll estimations come from the U.S. Department of Energy, 
https: //www.osti.gov/opennet/manhattan-project-history/Events/1945/nagasaki.htm 
Unbttps://www.osti.gov/opennet /manhattan-project-history/People/people.htm 
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occupation. The total death toll is often quoted to be of the order of 65 million 
people, with about 27 million casualties in the Soviet Union alone. Thus only 
a small but significant percentage is due to the use of nuclear weapons, which 
luckily enough has so far stayed the only one. 


6.5 COLD WAR PHYSICS 


Tensions between the allies agains the Nazi regime appeared immediately after 
the end of World War II'*. The strict security measures in the Manhattan 
Project succeeded to keep the bomb program from being spied on by Ger- 
man and Japanese enemy intelligence, such that only rumours leaked. Soviet 
intelligence, however, was more successful, probably also because of people 
with sympathy for communism in Britain and the U.S. The Soviet govern- 
ment learned about the nuclear bomb plans already in 1941, a year ahead of 
the formation of the Manhattan Engineer District. The information allegedly 
came from within the so-called “Cambridge Five” spy ring in Britain, whose 
members were Guy Burgess, Donald Maclean, Kim Philby, Anthony Blunt and 
John Cairncross!?. At the same time the sudden drop of American and British 
publications on nuclear research caught the attention of Soviet physicists, but 
interestingly enough not German ones, probably since they though they were 
far ahead anyway. Soviet intelligence made intensive attempts to penetrate via 
key laboratories like the University of California at Berkeley and the Chicago 
Met Lab, attempts mostly stopped by FBI and internal counterintelligence. 
However, one central figure, the British physicist Klaus Fuchs, made it into the 
heart of the Manhattan Engineer District, working with the Theoretical Divi- 
sion at Los Alamos. After returning home he continued to inform the Soviet 
Union about British bomb research and development. After his uncovering 
in 1950, he served nine years in prison in the U.K., moved to East Germany 
and became one of the GDR’s scientific leaders. Additional engineering infor- 
mation was funnelled to Soviet channels by Julius and Ethel Rosenberg, née 
Greenglass, and her brother David Greenglass. The Rosenbergs were executed 
in 1953 despite an international campaign in their favour. 

Notable scientist spies in the Canadian part of the Manhattan Project 
were Alan Nunn May, a collaborator of Chadwick, and Bruno Pontecorvo from 
Enrico Fermi’s group. Both worked on the water-moderated nuclear reactor 
at Chalk River, Ontario, and returned to Britain after the war. Following 
the arrest of Klaus Fuchs, Pontecorvo’s Soviet handlers became worried that 
he would be uncovered, and in 1950 Pontecorvo covertly went to the Soviet 
Union with his family. Pontecorvo continued his work as a physicist, and 
made important contributions e.g. to neutrino physics (see Section 9.1), all 


12Many facts and figures in this Section come from the Atomic Archive series of web 
pages, http://www.atomicarchive.com/History/coldwar/index.shtml and Odd Arne Wes- 
tad’s seminal work [646]. 

13For an interesting making-of the Cambridge Five, see the book by Davenport [651], 
who identifies British societal tendencies of the time which according to him lead to the 
long-term rise of populism and Brexit. See also the review of his book by John Banville [663]. 
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the while continuing to deny that he had been a spy. Many other sources in the 
Manhattan Project remained known only by their codenames from intercepted 
messages. 

In 1946, the U.S. government made a proposal to put all nuclear energy 
activities under international control, including on-site inspections, called the 
Baruch plan after its author!*, and based on a comprehensive report by Secre- 
tary of State Dean Acheson and David Lilienthal [306]. The plan was promptly 
rejected by the Soviet Union and not adopted by the United Nations either, 
just like the Soviet counter-proposal of universal nuclear disarmament. The 
U.S. resumed nuclear bomb testing immediately afterwards, since 1947 under 
the civilian authority of the Atomic Energy Commission (AEC) [360]. The 
AEC was originally chaired by Lilienthal, and scientifically advised by a sub- 
committee headed by Oppenheimer. Both men notably opposed the develop- 
ment of a nuclear fusion weapon, popularly know as a hydrogen or thermonu- 
clear bomb, leading to their removal from the AEC. 

The first two nuclear tests after World War II -code-named Crossroads- 
were conducted over the Bikini Atoll in the Pacific, using plutonium fission 
bombs of the Nagasaki type. One bomb detonated over water, the other under 
water, with derelict battle ships as test targets. They were largely publicised as 
yet another demonstration of the unprecedented power of nuclear armament 
and U.S. dominance. 

Meanwhile the Soviet effort to gain access to this new weapon had been 
jump-started by the successful intelligence during the Manhattan Project. The 
project thus progressed rapidly under the leadership of Igor V. Kurchatov’. 
Following detailed information obtained through Klaus Fuchs, a copy of the 
Fat Man design, nick-named Joe-1 in the West, exploded at the Semipalatinsk 
test site in Kazakhstan end of August 1949. The arms race predicted by the 
Met Lab scientists was on. But no other Soviet test followed for two years. 

During the Manhattan Project, Edward Teller had already come for- 
ward with the idea to use nuclear fusion for military purposes. Bombs using 
nuclear fusion are also called thermonuclear weapons or (misleadingly) hydro- 
gen bombs. Given the quick success of the Soviets with a fission bomb, Teller’s 
idea gained momentum. The reasons are threefold. First, the energy gain per 
nucleon in the fusion of light elements into e.g. the doubly magic “He is even 
greater than that of fission, as shown in Focus Box 6.4. Second, the raw materi- 
als for nuclear fusion, like deuterium and tritium, are relatively abundant and 
readily available in normal water. Third, in contrast to fission bombs, where 
there is a lower and an upper limit on the fissionable mass, the total energy 
yield in a fusion bomb is scalable: the more nuclei are fused, the more energy 
is released. The proposal to pursue this path met substantial resistance, not 
only by the AEC but also by individual Nobel laureates like Fermi and Isidor 
I. Rabi, who criticised the plan as immoral and technically unfeasible [360, 


M4nttp://www.atomicarchive.com/Docs/Deterrence/BaruchPlan. shtml 
15The Russian National Research Center for nuclear and particle physics is named after 
Igor Kurchatov, see http: //eng.nrcki.ru 
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When you look at the graph of the binding energy per nucleon in Figure 5.7, 
you notice that the rise at low masses is much steeper than the slow decrease 
at large masses. This steep rise is used by nuclear fusion in stars to liberate 
binding energy. The yield per nucleon is larger that the gain in fission, but 
since the nuclei have few constituents, the gain per nucleus is smaller. Light 
nuclei are, however, readily available as raw material. 

To fuse two nuclei, Coulomb repulsion must be overcome. The maximum 
potential Vo, when two nuclei (A, Z) and (A’, Z’) just touch, is the order 
of a few MeV: 


e2 


pae; 1 
Vo = 422 Al/3 + A173 


To overcome it by thermal motion of the nuclei, a substantial density and a 
very high temperature, T = Vc/k ~ 10!° K, is required. Typical temperatures 
in the interior of the Sun are more like 10” K, but the tail of the Maxwell 
temperature distribution is sufficiently long to allow fusion. Raw material in 
the form of hydrogen plasma is abundantly available. The principle energy 
source for light stars is the so-called proton-proton chain: 


IH+!H + *H+et+v,+0.42MeV 
H+ 7H — °He+Yy+5.49MeV 
3He+°He — *He+2'H+12.86MeV 


The large energy liberated in the last step is due to the exceptionally high 
binding energy of the doubly magic *He nucleus. The total process burns four 
hydrogen nuclei and generates almost 25MeV energy. The positron produced 
on the way annihilates with plasma electrons and contributes to the energy 
balance. So do the gamma rays. The Sun has enough fuel to shine for another 
~ 10° years. A second important source of energy for heavy stars is the CNO 
cycle, which can be summarised as 3(4He) + !?C+7.27MeV. Heavier elements 
are synthesised by fusing additional helium nuclei, those beyond iron by fusing 
neutrons, e.g. in the merger of binary objects. 

When constructing a nuclear fusion bomb, similar conditions of density and 
temperature must be reached. There is thus usually a two-staged process: a 
fission detonator, itself an implosion-type uranium or plutonium bomb, gener- 
ates the pressure and temperature to cause a pallet of deuterium and tritium 
to fuse via the reaction 7H + 97H — *He + n + 17.6MeV. The pellet usually 
consists of lithium deuterate, ĉLiO2D2, which can supply both the deuterium 
by thermal dissociation and the tritium by fast neutron fission. 

For the last decades, there is a world-wide effort to use controlled nuclear 
fusion for energy production. The latest example is the ITER project [689] 
under construction in the south of France. It will use magnetic confinement 
in a Tokamak to contain a plasma in its own magnetic field and an external 
toroidal one. Another confinement method is the Stellarator [504], which uses 
a more complex magnetic field and relies less on the dynamics of the plasma 
itself. A German-American demonstrator project is the Wendelstein 7X. 


Focus Box 6.4: Nuclear fusion 
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p. 380ff]. However, in 1950 President Truman decided that the development 
of a fusion bomb was to go ahead in view of the intensifying cold war. The 
opponents Lilienthal and Oppenheimer were subsequently removed from the 
AEC. 

In the end of 1952, a first fusion bomb constructed according to the two- 
stage design by Teller and Ulam was detonated on an uninhabited atoll of 
the Marshall Islands, with a stunning yield of more than 10 megatons TNT 
equivalent, about 42 x 10!°J. The explosion wiped out the island housing 
the installation, its mushroom shaped cloud was more than 150km wide and 
reached to an altitude of 40km. The device was not a transportable bomb, 
since it used liquid tritium at —250°C as fusion material and required a cooling 
plant. It was ignited by a fission bomb, creating the necessary temperature 
and pressure conditions for nuclear fusion. Subsequent models used a similar 
two-stage design, but much more compact with solid lithium deuterade as fuel. 
The second test in spring 1954 on the Bikini atoll, with a yield of almost 16 
megatons TNT equivalent, was the most powerful U.S. bomb ever detonated. 
It left a giant crater and blasted millions of tons of radioactive debris into 
the atmosphere. The inhabitants of neighbouring islands and the crew of a 
Japanese fishing boat suffered from severe exposure to radiation. Short term 
radiation sickness and long term effects like thyroid cancer were caused by the 
exposure. 

The Soviet answer did not take long. Based on initial information from 
Klaus Fuchs, Andrei Sakharov designed a fusion bomb with layers of fusion 
fuel and uranium. In summer of 1953, a test of such a device in Siberia yielded 
about 400 kt TNT equivalent energy. However, the design was not really scal- 
able and thus abandoned in favour of a two-stage design a la Teller-Ulam. In 
November 1958, the first Soviet megaton bomb exploded at the Semipalatinsk 
test site. In a series of further tests, a maximum yield of almost 60 megatons 
was achieved in 1961. 

Meanwhile, the U.S. was under the influence of anti-communist paranoia 
during the McCarthy era. In 1953, Oppenheimer saw his security clearance 
suspended by the AEC, on a charge of long term association with the Commu- 
nist Party. He demanded a security hearing in front of the AEC which lasted 
several weeks and heard 40 witnesses [537]. General Groves, Bethe, Teller and 
Rabi testified in his favour. Nevertheless, the majority of the AEC, with the 
notable exception of Smyth, did not recommend that he be given further access 
to sensitive information. After this devastating decision, Oppenheimer retired 
from public life and continued his research as the director of the prestigious 
Princeton Institute of Advanced Studies. 

Nuclear weapons were not limited to the U.S. and the Soviet Union for 
long. Britain followed in 1952 with a fission bomb explosion in Australia, in 
1957 with a fusion bomb test over Christmas Island in the Pacific. France 
followed in 1961 with a fission test in Algeria, and in 1968 with a fusion bomb 
in the South Pacific, and continued atmospheric testing until 1974. In 1995, 
France even resumed nuclear testing under global protest. However, France 
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has now joined the other major nuclear powers in ratifying the Comprehensive 
Nuclear Test Ban Treaty (CTBT)'!® and the Treaty on Non-Proliferation of 
Nuclear Weapons (NPT)!”. 

Despite all efforts to prevent further proliferation of nuclear weapons, 
China, India, Pakistan and lately North Korea joined the nations officially 
claiming the possession of nuclear armament. Israel is suspected to have a 
covert nuclear weapons program but does not acknowledge it. Other coun- 
tries in the Middle East like Iran are regularly accused to aim at producing 
highly enriched ?’5U for military purposes. The Cold War strategy of Mutu- 
ally Assured Destruction (MAD), which strongly deterred first use of nuclear 
armament by guaranteeing that nuclear retaliation would lead to total destruc- 
tion of the aggressor, led to an unimaginable nuclear arsenal. That the MAD 
strategy worked was convincingly demonstrated in the Cuban missile crisis. 
Starting in 1962 a large number of Soviet military personnel arrived on the 
island to install nuclear ballistic missile sites, promptly detected by U-2 spy 
planes. The largest rockets installed had a range of 1200 miles, enough to 
reach large cities in the south and east of the U.S. The threat was countered 
by the Kennedy administration with a naval blockade on October 22, 1962. 
The U.S. made clear it would not allow further delivery of Soviet arms to an 
island 90 miles off its coast and demanded that all weapons already in place 
be dismantled and brought back to the Soviet Union. Through threat and 
intense parallel diplomacy, an agreement was reached and the crisis resolved 
13 days later [646, p. 305 ff]. The whole world felt that it had been on the 
brink of a nuclear catastrophe. My parents, like many Germans encouraged 
by the Adenauer government, stockpiled water and food according to a list 
distributed by civil defence, in the basement that had already served as a 
makeshift shelter in World War II. Switzerland continues to maintain a whole 
system of public bomb shelters beyond the end of the Cold War!® in 1989. 

The nuclear arms threat did not stop. For 2018, the Stockholm Inter- 
national Peace Research Institute reported a total of “approximately 13,865 
nuclear weapons, of which 3750 were deployed with operational forces” [674]. 
Proliferation control is made difficult by the dual use of enriched uranium. 
Nuclear power plants need uranium enriched in ?35U to about 3 or 4%. There 
are about 450 of those operating today, a number grossly constant since 1990; 
they supply about 10% of the world supply in electricity!”. Despite three major 
nuclear accidents with core meltdown [632], nuclear energy stays an important 
component of the energy budget in many countries. The NPT thus recognises 
the “inalienable right” of all nations to develop nuclear technology for peace- 
ful purposes. But the technology that enriches uranium for power plants is of 
course also capable of producing weapon-grade fissionable material, making 


16nttps://www.ctbto.org/fileadmin/content/treaty/treaty_text.pdf 

\https://www.un.org/disarmament/wmd/nuclear/npt/ 

18For Westad’s perception of the end of the cold war, see [647]. 

l9https://www.world-nuclear .org/information-library/current-and-future- 
generation/nuclear-power-in-the-world-today.aspx 
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uranium enrichment the Achilles’ heel of non-proliferation. The International 
Atomic Energy Agency (IAEA) in Vienna does its best to “establish and 
administer safeguards”?° preventing military use of fissionable material and 
other nuclear technology. 
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CHAPTER / 


Quantum fields 


The third aspect of my subject is that of science as a method of find- 
ing things out. This method is based on the principle that observation is 
the judge of whether something is so or not. All other aspects and char- 
acteristics of science can be understood directly when we understand 
that observation is the ultimate and final judge of the truth of an idea. 
But “prove” used in this way really means “test,” in the same way that 
a hundred-proof alcohol is a test of the alcohol, and for people today 
the idea really should be translated as, “The exception tests the rule.” 
Or, put another way, “The exception proves that the rule is wrong.” 
That is the principle of science. If there is an exception to any rule, and 
if it can be proved by observation, that rule is wrong. 


Richard P. Feynman, The Meaning of It All, 1999 [512] 


N THIS CHAPTER I describe how Quantum Field Theory puts together 

quantum mechanics and relativity. The shift from atomic states and tran- 
sitions towards particle reactions is an important aspect of this development. 
I will thus also discuss what quantum fields are, how they move and inter- 
act. Biographies of the prominent protagonists, Max Born, Enrico Fermi and 
Richard P. Feynman, are quoted in the Further Reading section. 


7.1 LIMITS 


Quantum mechanics certainly has philosophical problems which we glossed 
over in Section 5.6. But more important are the physical limits of its applica- 
tions. Its equation of motion, the Schrödinger equation (see Focus Box 5.9), 
is clearly not covariant under Lorentz transformations and cannot be used to 
describe systems in relativistic motion. The need for a relativistic extension 
is thus evident. The equation’s second problem is that it does not allow the 
creation or disappearance of particles. There is a continuity equation, which 
describes the fact that the probability density in an arbitrary volume can only 
change when probability “flows” in or out through the volume’s boundaries. 
We demonstrate this fact in Focus Box 7.1. This means that the number of 
particles in a box can only change when particles enter or leave the box. The 
total number of particles in a system is conserved. 
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The Schrödinger equation implies the conservation of probability. We demon- 
strate this for a free particle: 


1 
2m 


_O 23, _ 
izt + Vy =0 


We multiply the equation with ~*, and its complex conjugate with w: 
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Taking the difference of the two equations, we get: 
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The first term denotes the time variation of the probability density p, the 


second is the divergence of the probability current density j: 
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The two quantities are related by a continuity equation Op/dt+\Vj = 0. What 


it means is most evident when we consider free particles with momentum p 
and energy E. Their wave function is a plain wave: 


p = V Ne- Et #2) 


The probability density p and the current density j of such a wave are: 


p = uy = Ne Et-PE) oi(Et- pz) =N 
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The current density is thus the product of density and velocity, in close anal- 
ogy to the classical electromagnetic current of a particle cloud. The equation 
does not allow the number of particles to change. When their local density 
diminishes, particles have flown out, when it increases, particles have flown 
in. 


Focus Box 7.1: Continuity equation for non-relativistic particles 
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We know that this is not true, and so did Schrödinger. Photons can be 
absorbed and emitted by charged particles. And even those can be created 
in pairs, like electrons and positrons, and also annihilate into light. It is thus 
quite possible to create particles and make them disappear, when the conser- 
vation of quantum numbers like the electric charge is respected. Schrödinger 
proposed to replace the particles density p by the charge density Qp and 
the corresponding current density Qj. The conservation of probability then 
implies the conservation of electric charge Q. We will find back this “trick” in 
what follows, but for an even more important reason. 


7.2 MOVING FIELDS 


The prototype of a relativistically moving particle is the photon. Consequently, 
the classical Maxwell equations are already covariant under Lorentz transfor- 
mations. The motion of the photon field consisting of E and B has the form 
of a wave equation, as shown in Focus Box 5.9. In analogy to this, Louis 
de Broglie [199] proposed in the beginning of the 1920s to also describe the 
motion of matter particles like the electron by a wave equation (see Sec- 
tion 5.4). Interference experiments with electrons indeed showed that matter 
also propagates in waves [219, 224]. One thus cannot maintain the classical 
concept of a trajectory —a smooth curve describing the position as a function 
of time- at distances comparable to the atomic size. 

It remained unclear for quite a while, how the concept of matter waves 
was to be applied to anything else but a free particle. And thus the next step 
in the development of quantum mechanics was rather the matrix concept of 
Heisenberg, Born, Jordan and Pauli in the middle of the 1920s (see Focus 
Box 5.7). Since the observation of quantum mechanical systems is only possi- 
ble via interactions delivering observables, the matrix mechanics focussed on 
radiative transitions in the atom and other interactions, not on the motion of 
particles. We come back to the focus on observables in Section 7.3. 

Erwin Schrodinger revived wave mechanics in 1925 in his famous series 
of four publications [213, 214]. In his first delivery, he presented the equa- 
tion of non-relativistic motion named after him. In the fourth, he describes a 
relativistic version also based on considerations of energy-momentum conser- 
vation. Schrodinger at first abandoned this route since it does not describe the 
energy levels in atoms correctly, unless one introduces spin. Spin is important 
here because in relativistic motion its magnetic interaction is not negligible. In 
parallel, Oskar Klein [210] and Walter Gordon [209] published the same idea, 
the relativistic equation of motion for scalar fields is named after them [455]. 
The Schrédinger equation follows from the non-relativistic energy-momentum 
conservation by operator substitution, i.e. quantisation. Likewise, the Klein- 
Gordon equation follows from its relativistic equivalent, as shown in Focus 
Box 7.2. It is a second order partial differential equation for fields described 
by a scalar wave function w. It appeared less attractive than Schrödinger’s 
equation, which is linear in time, because it requires more initial conditions. 
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It was thus forgotten for a while until Wolfgang Pauli and Victor Weisskopf 
brought it back to public attention in 1934 [262]. The equation has solutions 
with positive and negative energy eigenvalues. While we are familiar with neg- 
ative potential energies from the classical physics of bound systems, negative 
kinetic energies for free particles make no sense. Moreover, the signs of energy 
and probability are the same, thus negative energies correspond to negative 
probabilities. This makes even less sense, since probabilities are real numbers 
between 0 and 1 by definition. The problem can be fixed using the same “trick” 
of re-interpreting probability density as charge density a la Schrödinger. That 
now links the sign of the particle charge to the sign of its energy, a very 
significant fact. 

The senseless negative probabilities may well have motivated the British 
theoretician Paul Dirac to search for an equation of motion linear in time. 
Since the relativistic energy conservation law has roots of negative energy 
due to its quadratic form, one may suspect that the negative probabilities in 
the Klein-Gordon equation are due to its quadratic form. A promising ansatz 
would thus be a linear wave equation analog to Schrödinger’s, but linear in 
time and space, as detailed in Focus Box 7.4. The solutions of this equa- 
tion must simultaneously obey the Klein-Gordon equation to ensure energy- 
momentum conservation. For scalar fields and scalar operators this is not 
possible. The operators are rather 4 x 4 matrices and the wave function has 
four components. Dirac’s equation of motion for these fields thus consists of 
four coupled partial differential equations for four fields (1, Y2, Y3, Y4). The 
total field 7) is called a spinor', since the four components correspond to states 
with positive and negative energy, each with spin “up” and spin “down”. All 
have the same mass, thus propagate in the same way when no force is present. 

The probability density of particles propagating according to Dirac’s equa- 
tion is positive definite. The problem of negative probabilities is thus solved, 
the conservation of the electromagnetic current density is no longer obtained 
by a trick, but directly follows from the conservation of the probability current. 
However, since Dirac particles must also respect the Klein-Gordon equation, 
negative energies are still with us. They must be interpreted. 

In the 1930s, Dirac proposed to interpret the solution with negative energy 
as if the vacuum were filled with an infinite “sea” of electrons with negative 
energy. Normally all states with negative energy would thus be filled. If an 
energy of more than 2m.c? is transferred to such a state, it would transport an 
electron from negative to positive energy. Simultaneously, a “hole” with posi- 
tive charge would be created in the vacuum, in analogy to holes created in the 
valence band of a semiconductor when an electron is lifted to the conduction 
band. This interpretation has numerous problems. Among others the electro- 
static energy of the vacuum would be infinitely large and it is unclear how the 
hole would move through the sea of electrons. Dirac’s proposal, however, has 
the important significance to associate negative energies with antiparticles, for 


1 Although a Dirac spinor has four components, it is not a four-vector. Its four components 
behave like scalars under Lorentz transformations. 
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The Klein-Gordon equation for the evolution of a scalar relativistic field follows 
from relativistic energy momentum conservation by operator substitution (see 
Focus Box 5.9): 


2 
E-#-mM=-0 > (7-3 -m) v=o 


Here and in what follows, we use what is called “natural units”, explained in 
Focus Box 7.3, so that the constants h and c no longer appear. The Klein- 
Gordon equation is manifestly covariant, which becomes obvious by using the 
four-vector gradient 0,, = d/öxt = (8/dt, V): 


(0,,0" + m?) yY = 0 


Using the same approach as for the Schrödinger equation, one derives a con- 
tinuity equation for the following probability and current density: 
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One may be surprised that the probability density now contains a derivative of 
the wave function. With the probability current density it forms a four-vector 
j! = (p, J) = i(Y* ty —WO"Y*), which obeys a covariant continuity equation, 
ð j” = 0. For plane waves, = VNei{P#-Et), the components of j” are: 
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The current density is proportional to the four-momentum, j” = 2p"N, its 
timelike component is proportional to energy. This is due to the requirement 
of relativistic invariance of the probability, p d?z. After all, the number of 
particles is independent of the reference frame. The volume element transforms 
like d?z — d3%/y because of Lorentz contraction in the direction of motion. 
To obtain invariance, the probability density must thus transform like p — yp. 
Since y = E/m, p must thus be proportional to energy. A unit volume thus 
contains 2E particles with our normalisation. 

The energy eigenvalues for a free relativistic particle are obtained when 
one inserts a plane wave function into the Klein-Gordon equation: E = 
+(p? + m2)2. There are thus solutions with positive and negative energy. 
And since the probability density is proportional to energy, the probability is 
no longer positive definite. One must re-interpret the continuity equation for 
a four-vector of electromagnetic current density, j” = iQ(y* 0 y — OH"). 
Proportional to the electric charge Q, 7° becomes the charge density, which 
can indeed be positive as well as negative. Solutions with positive and nega- 
tive energy are distinguished by the charge of the particle. And the continuity 
equation describes the conservation of electric charge. 


Focus Box 7.2: Klein-Gordon equation 
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Relativistic quantum systems are characterised by two natural constants, 
Planck’s constant of action A and the speed of light c: 


h 
= z ~ 1.055 x 1078tJs ; c = 2.998 x 10°m/s 
T 


Their dimensions are [A] = ML? /T and [c] = L/T, where M denotes mass, L 
length and T time. The natural system of units makes both constants dimen- 
sionless and equal to unity, A = c = 1; it defines h as the unit of action and c 
as the unit of velocity. Both constants thus disappear from all equations. This 
permits e.g. to measure all masses (M), momenta (Mc) and energies (Mc?) 
with the same units: 


ML? 


[E, M, p] = 3 


GeV = 10° Vx Mp 


The base unit electronvolt (eV) is defined as the energy gained by a unit 
charge particle traversing a potential difference of 1 V. A billion of these 
units, 1 GeV, corresponds roughly to the mass of the proton and thus forms 
the natural scale of high energy physics processes. When calculating ordinary 
quantities, one obviously need to convert results back into SI units. Useful 
conversion constants are given in the table below. 


Quantity Conversion factor Natural units | SI units 
Mass 1 kg = 5.61 x 1076 GeV GeV GeV/c? 
Length | 1m=5.07 x 10% GeV“! Gev—" he/GeV 
Time 1s= 1.52 x 10% GV! Gev! h/GeV 


In the system of natural units, the electric charge is also dimensionless, [e] = 
[Vfic] = [1]. It often appears in the form of the fine structure constant a, 
which is the electrostatic energy of two electrons at unit distance, divided by 
the mass of the electron: 

i 2 

An nime e? 1 


= me 4mhe = 137 


Focus Box 7.3: Natural units 
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We construct a relativistic equation of motion which has first order time 
and space derivatives, the Dirac equation: idwdt = (ap + Bm)ıb. Energy and 
momentum are conserved if its solutions also respect the Klein Gordon equa- 
tion (see Focus Box 7.2), —0?w/t? = (P + m?)ıb. Comparing coefficients we 
get an algebra for the operators @ and ß: 


(ap + Bm)? =p +m? ; (I aipi + Bm)(S— agp; + Bm) = p + m? 
i j k 


a? =œ =o} = 8 =1 ; aiaj +aja=0 ; œb +Ba;=0 (i Æj) 


The a; and 8 anti-commute, they cannot be simple scalars. Their algebra is 
respected by hermitian and traceless 4 x 4 matrices with eigenvalues +1. The 
wave function ı thus also has four components, (Y1, Y2, Y3, Y4), it is called a 
spinor. Multiplying the Dirac equation by 8 from the left, we get a compact 
notation for the Dirac equation: 


ð > 
Be = —iBaVy + mpy > (70, —m)p =0 


We are defining a four-vector of operators, y” = (8, 8a). The algebra of its 
components follows from that of the @ and 8 matrices: 


(7°)? = pe =l ; (y*)? = Bak Bak = —1 ; ya? + ya = Pr 


The “square” of a spinor is «Tab with the hermitian adjoint spinor yt = 
(16, Yi, Y3, V3). Using the same method as for the Schrödinger and Klein- 
Gordon equation, we derive a continuity equation: 


On (bya) =0 


with the Dirac adjoint spinor ı) = wiy°. The term in parentheses is a four- 
vector current of the probability density: 


ebd; P=dPdv=o ; J=bW 


Its time-like component is positive definite, the problem of negative proba- 
bilities is thus solved. The electromagnetic current density is: jt = Quy". 
For a free electron with energy E and momentum jp it is j” = —eN(E,p), 
the one of a positron with the same energy-momentum is j" = +eN (E, p) = 
—eN(—E,—p). A positron thus evolves the same way as an electron with 
opposite four-momentum. 


Focus Box 7.4: Dirac equation 
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the first time. Some take this as a prediction for the existence of the positron, 
discovered two years later in cosmic rays (see Section 8.1). The hole theory 
itself did not survive. 

The Genevan Ernst Stückelberg? [297] and the American Richard P. Feyn- 
man [311] found a much more convincing explanation of negative energy states. 
Their ansatz is to identify negative energies with particles evolving backwards 
in time, or equivalently antiparticles evolving forward in time. The current 
density of both is the same. Thus the emission of a positron with positive 
energy is identical to the absorption of an electron with negative energy. One 
can also say that particle solutions with negative energy can always be replaced 
by antiparticle solutions with positive energy and inverted momentum direc- 
tion, as shown in Focus Box 7.4. 

This has more dramatic consequences than it seems at first glance. Look at 
the space-time sketch of Figure 7.1. In the left diagram, the incoming electron 
scatters twice off the potential symbolised by a cloud, first at position (1), 
then at (2). In the right diagram, the time order is reversed. The intermediary 
electron now moves backward in time. We replace it by a positron moving 
forward. Thus an electron-positron pair is first created at (1), the resulting 
positron is annihilated with the incoming electron at (2), only the electron is 
left in the final state. According to Heisenberg’s uncertainty relation, if the 
energy is sufficiently fixed, the time order of (1) and (2) cannot be known, 
the two processes are indistinguishable. At the same time, they only differ in 
that an electron moving backward in time is replaced by a positron moving 
forward, which changes nothing according to Feynman and Stückelberg. Their 
interpretation of negative energy solutions thus naturally leads to particle 
creation and disappearance processes required by observation. 


Figure 7.1 Space-time diagram for the scattering of an electron off a potential 
symbolised by a cloud. On the left, the electron simply scatters twice at posi- 
tions (1) and (2). In the right diagram, the creation of an electron-positron 
pair at (1) is followed by an annihilation of the positron with the incoming 
electron at (2). Because of Heisenberg’s uncertainty, the two processes are 
indistinguishable at small scale. 


?His full name was Baron Ernst Carl Gerlach Stückelberg von Breidenbach zu Breiden- 
stein und Melsbach. 
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7.3 INTERACTING FIELDS 


Knowing how quantum fields move and how potentials are created by charges 
and currents, it remains to understand how they interact with each other. 
Relativistic quantum field theory” provides that understanding. But an addi- 
tional conceptual step is required: not only observables, the matter fields and 
potential fields themselves become operators. This way the focus shifts from 
the motion of a particle in a potential towards the interaction of quantised 
fields. Matter particles are the quanta of fields. Force fields (or rather poten- 
tial fields) have quanta like the electromagnetic potential, which is quantised 
in photons as seen in the photoelectric effect (see Section 5.2). We thus come 
back to the particle nature of light, as already discussed by Newton in his 
“Opticks”. 

The form of quantised electrodynamics we know today has been devised 
by Richard P. Feynman in the 1940s. Matter and force fields are operators. 
The matter field is an operator for the creation or annihilation of a particle 
at a given space-time point. With Freeman Dyson [323] we can also call them 
emission and absorption operators. I find this more to the point, since the 
space-time point we are talking about is typically an interaction vertex. An 
incoming particle is absorbed at that point, a final state particle is emitted. 
Probability amplitudes result when such operators are applied to the space 
which contains all states. While wave functions have thus changed their mean- 
ing, we can still take them as amplitudes of probability waves. We will continue 
to go back and forth freely between operators, their representations and their 
eigenvalues. In particular, the meaning of the conserved current density stays 
the same. It plays an important role in scattering theory, as we will see shortly. 

The central notion of a scattering theory based on these concepts is called 
an invariant amplitude if one likes wave mechanics or a matrix element if one 
prefers matrix terminology. It is the probability amplitude for the transition of 
an initial state 4; into a final state Ys, and it is written as M fi. Consequently, 
its square |Myi|? is the probability density for this process. The square of 
the invariant amplitude, suitably normalised, enters into the calculation of 
an observable, which measures the probability for the process in question. In 
atomic physics, this observable is the intensity of a spectral line. For particle 
reactions, it is the cross section o or the decay width I. The concept of the 
cross section is a geometric one, sketched in Figure 7.2 for a reaction a +b — 
c+d. We associate a fictitious surface o to each target particle b. Whenever a 
projectile a hits such a surface, the reaction takes place, otherwise it doesn’t. 
The reaction rate is thus the product of the flux of incoming particles a, i.e. 
their number per unit surface and per unit time, the number of target particles 
exposed to this beam, and the cross section. This simple relation supposes that 
the surface density of target particles is small enough, such that the fictitious 
surfaces do not overlap and their total surface presented to the incoming beam 


3A detailed history of quantum field theory, with complete documentation, is found in 
[479, p. 1-38]. 
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is simply their sum. The cross section thus has the dimension of a surface, but 
of course no such surface exists. Instead, o measures the probability for the 
process to take place. We explain the relation between invariant amplitude 
and cross section, Fermi’s golden rule [420], in Focus Box 7.6. 

The product between incoming projectile flux and the number of target 
particles is a characteristic of each experiment, called the luminosity. It is the 
maximum rate of reactions which would be observed if each projectile would 
interact with a target particle. The cross section is thus the ratio of the actual 
reaction rate and its maximum: 


#reactions projectiles 


: =- x #targets xcross section 
time time x surface 


luminosity 
It is indeed a probability in strange units, measuring the strength of a reaction 


independent of experimental details. For particle waves, the cross section is 
discussed in Focus Box 7.5 using Huygens’ principle. 


Figure 7.2 Geometrical concept of the cross section for a particle reaction 
a +b — c+ d. Each target particle b is associated with a fictitious surface ø. 
The reaction takes place whenever a projectile a hits that surface. 


In an experiment, the cross section is extracted from the reaction rate, by 
dividing by the only experiment-dependent factor, the luminosity. It is mea- 
sured in the enormously large units of barn, 1b = 107?8m?. Cross sections one 
encounters for reactions among elementary particles are rather of the order of 
nanobarn, Inb = 10-°b, or picobarn, 1pb = 10-1?b = 10740m?. For neutri- 
nos they are even as small as 10°*?m? (at 1 GeV). Instead of just counting 
the total number of reactions, one can classify them according to observables 
like the scattering angle, i.e. the space angle Q between an incoming particle a 
and an outgoing reaction product c. In that case, one measures the differential 
cross section da /dQ. 

In case of a decay a > c+d+..., i.e. of a single particle in the initial state, 
the corresponding notion quantifying the strength of the interaction is the 
decay width I, as explained in Focus Box 7.7. Its inverse is the lifetime 7, of the 
mother particle a. The width has the dimension of an energy and corresponds 
to the width of the mass distribution of the final state. The momenta and 
energies of the final state particles add up as four-vectors. The square of the 
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Let us formulate Huygens’ principle for an incoming plane wave with 
frequency w and wave vector k, pi(x,t) = VNe-iwt-k#) impacting on a 
point-like scattering centre. What happens is sketched below. 


scattered tranemitea A Slightly diminished plane wave goes through 
„rare ww undisturbed, and a spherical wave with small 
=e amplitude originates from the scattering cen- 
tre. Let us normalise the incoming plane wave 
simply with N = 1, such that y; = e7**#*" is 
a wave with probability density p = 4? = 1. 
Its intensity I is proportional to this density 
and the velocity v = w/k, I = pv = w/k. 


incoming plane wave 


According to Huygens’ principle, the outgoing wave is a superposition of the 
transmitted wave with amplitude (1 — f) and the scattered wave with ampli- 
tude f: 


eikurt 


dr = (1- F0, p)) er" + F0, 6) 


The first term describes the transmitted wave. The factor 1/r in the term 
for the scattered wave, inversely proportional to the distance r from the 
scattering centre, is necessary to preserve the normalisation. At any point 
and time, the total intensity will be the square of this wave function and 
thus have three terms: the square of the transmitted wave, the square of 
the scattered wave and an interference term between the two. Ignoring 
interference for the moment, the intensity scattered into a volume element 
characterised by the solid angle element dQ and the radial thickness dr is 
ikr 


“ine ]2 
|fe, p) = (r2dQ)dr = |f (0, $)|? dQdr. The flux scattered into dQ is then 


|f(8, p)? an = |f (0, 6)|” v dQ. The ratio of this flux to the incoming one is 
the probability of scattering, the cross section: 


do = 2 


We can generalise this finding for an arbitrary initial state. The free state 
before the scattering can be written as a superposition of eigenstates Yn of 
the free Hamiltonian (energy) operator: palT, t) = >>, an(t)Wn(Z)e*#"*. The 
final state ~, can also be written in this way, but with other coefficients. 
Now take the action of the potential as a small perturbation at time t = 
0 and assume a pure initial state with a single coefficient a; = 1 and all 
others zero. According to Huygens’ principle, the incoming amplitude will 
be a little reduced, while a small additional amplitude aş is excited by the 
scatter. When the potential is know, it can be calculated. Indeed, as shown 
above, af, correctly normalised, is nothing else but the invariant amplitude 
M fi, the probability amplitude for this process. 


Focus Box 7.5: Huygens’ principle for quanta 
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Consider a reaction, where a fixed electromagnetic potential A“, like that of 
a heavy nucleus, converts a scalar initial state pa into a final state Ype. 


Wa “aC We To first order, we can consider that a single 
Ju action of the local potential A” transforms pa 
S into We: 
M = ~i | d'z jg A" 
@ 4, i In 


The electromagnetic current density, j4, = -ie (bi OH" pa — YadHıbk), which 
couples to the electromagnetic potential, A“, is the same as the one we had 
found for free scalar particles (see Focus Box 7.2), except that the incoming 
and outgoing wave functions are no longer the same. The argument of the 
integral is the probability density to find Y. among all the possible final states 
produced by A“ at the interaction vertex, x = (t, 7). The integration over 
space-time takes into account that the interaction can take place wherever 
AH reigns, i.e. anywhere for a static potential. The cross section can then be 
calculated by properly normalising the probability of the reaction, represented 
by |M|?. One obtains Fermi’s Golden Rule: 


| MI? 
© F 


do dQ 


The incident flux F depends on the relative velocity |Ua — p| of projectile and 
target: 


F = (Ua = A 2Ea2 Ep — 4 (papo)? + meme 


The second equation is valid for Va||Up and shows that this quantity is an 
invariant under Lorentz transformations. So is the phase space factor dQ, 
which takes into account the number of possible final state configurations: 


dpe d’pa 
n)’2E. (2m)°2Eg 


dQ = (27)t6® (pe + pa — Pa Po) 


It contains a 6-term ensuring energy-momentum conservation as well as the 
number of quantum states for all reaction products. The factors N; = 2E; 
come in because of our normalisation of the wave function (see Focus Box 7.2). 


Focus Box 7.6: Fermi’s golden rule 
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total mass of all decay products is the square of their total energy minus the 
square of their total momentum, m? = (> E;)? — (© pi)?. The invariant m 
corresponds to the mass of the mother particle a. For a finite lifetime, this is 
not a fixed number, but distributed around the mean value, ma. The width 
of the mass distribution Am, is inversely proportional to its lifetime, due to 
Heisenberg’s uncertainty, AmaTa > h. The shorter the lifetime of a particle, 
the wider the mass distribution around its mean value. 


For a generic decay of the mother particle a into at least two daughter parti- 
cles, a> c+d+..., the connection between decay width Fa and invariant 
amplitude M is: 


1 
(27 


d’p. d’pa 
QE. 2Eı 


1 
re a 4 a 
d 25, IMI )2 6 (Pa Pe Pa ) 


For a two-body decay in the rest system of the mother particle, we have: 


T(a > c+ d) = ii fome 


 3272m, 

with the momentum || = |p-| = |Pa| in the final state. The integration goes 
over the solid angle element dQ = sin Oddo with the polar angle 0 and the 
azimuth ¢. The decay width is measured via the decay rate, the fraction of 
mother particles, ANa /Na = -TAt, which decays in a laps of time At. Thus 
a constant fraction decays per unit of time, the remaining number of mother 
particles diminishes exponentially, with N.(t) = Na(0)e7''. The invariant 
mass of the decay products, m = \/(E£. + Ea)? — (De + Pa)?, i.e. the mass of 
the mother particle, follows a distribution around the mean value m. with 
width Ta. 


Focus Box 7.7: Decay width 


What we have quoted so far is the first term of a perturbative expansion, 
since we assumed that the initial and final state are free particles, which 
interact with the potential at a single space-time point. After its pioneer Max 
Born [217], this first order term is also called the Born approximation. For 
electromagnetic interactions, we calculate it in Focus Box 7.8. The progress 
of the perturbation series is counted by the order of the coupling constant 
between the current and the potential, e for electromagnetic forces. Further 
accuracy of the calculation is achieved when higher order terms are included, 
with a second interaction, a third one and so on. Provided, of course, that 
higher order terms are successively smaller and smaller perturbations, such 
that the series converges. The conditions under which this is indeed the case 
deserve a more in-depth discussion which we defer to Chapter 9. 

We now have to go one step further and describe how the electromag- 
netic potential is generated and how it reaches the space-time point where it 
interacts with the current density. The generation of fields and potentials is 
described by Maxwell’s equations. Or rather, Maxwell’s equations describe the 
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change of fields and potentials. But that is all we need. Once a current den- 
sity has caused an electromagnetic wave, it will propagate according to wave 
mechanics. We can thus describe the interaction as one between two currents, 
one that emits the photon, the other which absorbs it. And again this is an 
interaction, since the roles can be interchanged. Figure 7.3 represents such an 
electromagnetic scattering by exchange of a photon. Diagrams of this type are 
called Feynman diagrams after their inventor [457]. They represent the pro- 
cess visually, but they are also precise prescriptions for the calculation of the 
corresponding invariant amplitude. One can interpret them as space-time dia- 
grams of what happens during the interaction, but of course within the limits 
of Heisenberg’s uncertainty. As you can see in Focus Box 7.8, the conversion of 
interacting current densities into a probability amplitude involves an integral 
over space-time. It is really an integral over all possible paths which the par- 
ticles can take from the initial to the final state of a reaction. Thus Feynman 
called his method a space-time approach [311]. After the integration, all that 
is left are energies and momenta of the particles involved. We no longer care, 
nor can we possibly know where or when the interaction happened. For us, 
the straight lines with arrows in a Feynman diagram indicate matter particle 
momenta, wiggly lines indicate the momenta of force quanta, photons so far. 


Wa Je We 


Yo GO` ha 


Figure 7.3 Feynman diagram of an electromagnetic scattering a +b > c+ d 
in Born approximation. 


From the annotated Feynman diagram one sees that each element of the 
graph corresponds to an operator, or a probability amplitude; all enter mul- 
tiplicatively into the invariant scattering amplitude. The straight lines corre- 
spond to matter fields and symbolise their field operators. The current densi- 
ties Ya p annihilate the incoming fields at the interaction vertices, ı.,a create 
the outgoing fields. The wiggly line represents the exchanged photon, A,,, with 
its four-momentum q. We note that matter fields interact with the potential, 
and not with the electromagnetic field. We will discuss this further in Chap- 
ter 9. The scattering amplitude can be interpreted as the probability amplitude 
for the complete process. Its square is then the probability density, integrated 
over space-time it becomes the probability and determines the cross section. 

The correspondence between the elements of a Feynman diagram and the 
factors entering into the invariant amplitude for a process are called Feyn- 
man rules. How they can be derived for a give field theory is fixed by simple 
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We calculate the invariant amplitude for the process in Figure 7.3, using the 
result obtained in Focus Box 7.6. We have to evaluate the four-potential A“ 
as generated by the current density of the target, jh. The Maxwell equations 
being covariant under Lorentz transformations, we can also use them in the 
relativistic domain. They are also invariant under gauge transformations of 
the form A, — A, +0, x with an arbitrary scalar function x(x), because these 
transformations do not change the field tensor. We can thus change the gauge 
for the four-vector of the electromagnetic potential such that 0“ A, = 0. This is 
called the Lorenz gauge, named after the Danish physicist and mathematician 
Ludvig Valentin Lorenz (not to be confused with the Dutch Henrik Antoon 
Lorentz). In this gauge, Maxwell’s equations take the form of a wave equation: 


0, AM = j” 


A transition current density j4, for fields, which describe free particles before 
and after the interaction, produces the electromagnetic potential: 

1 

At = ge 

q db 
The four-momentum transfer, q = Pe — Pa = Po — Pa, is the difference between 
outgoing and incoming four-momentum of the projectile, which is transferred 
to the target. The invariant amplitude for this scattering process is then: 


1 
M= -i ta 52 
q 
Diagrammatically the amplitude is represented by the Feynman diagram 7.3. 


For fermions, the scattering amplitude in terms of spinor currents has some 
more complex math, but the underlying idea is exactly the same. 


Focus Box 7.8: Electromagnetic scattering in Born approximation 
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rules, almost like in a cookbook. One can indeed say: field theory = Feyn- 
man rules [486]. Here they define quantum electrodynamics, QED [457]. The 
electric charge e in the current density, which enters at each vertex, deter- 
mines the order of magnitude for the probability amplitude. The total scat- 
tering amplitude is thus proportional to e?, the cross section is proportional 
to a? = e?/(4m)? = 1/137. The photon propagator 1/q? expresses the prob- 
ability amplitude that the two current densities exchange a photon of four- 
momentum q. 

At each vertex, the four-momentum is strictly conserved. That means, on 
the other hand, that the exchanged photon has to have strange properties. 
The square of its four-momentum, i.e. the square of its mass, is not zero, as 
it would be for a free photon. This deviation from its rest mass is a property 
of all particles, which connect two vertices in a Feynman diagram, i.e. which 
transmit forces. They are therefore called virtual particles. Their propagator 
has a pole at the “true” mass of the free particle. For the photon, it lies at 
@ = 0. That the amplitude becomes very large for this value of the four- 
momentum transfer is irrelevant for QED. It corresponds to the limiting case 
of an interaction without momentum transfer, an interaction which does not 
really take place. That the probability for no interaction is much larger than 
that for a noticeable one is in agreement with our perturbative approach to 
interactions in general. 

We notice again one of the conceptually largest differences between the 
quantum and the classical approach to interactions. In classical physics an 
interaction, however small, always takes place, when a non-zero potential or 
force is present. In quantum processes, nature has a choice to let the interac- 
tion happen or not. And indeed no interaction is the default. 

We also notice that matter fields interact with the potential, not with 
forces. From your course of classical physics you may have taken away the 
impression that the liberty to gauge potentials gives them a somewhat fuzzy 
meaning. After all one can add a constant to the electrical or gravitational 
potential and the predicted trajectory of particles remains unchanged. Like- 
wise one can add a divergence-free vector field to the magnetic vector poten- 
tial and nothing changes in the classical case. In quantum systems, this is no 
longer true, particles interact with the potential even in the absence of a force 
field. The first to point out this astonishing effect were Werner Ehrenberg 
and Raymond E. Siday [313]. The experimental proof has been proposed by 
Yahir Aharonov and David Bohm [342, 349], first implemented by Robert G. 
Chambers [346], by Gottfried Möllenstedt and W. Bayh [364] shortly after. 
The mechanism is that the electromagnetic potential changes the phase of 
the matter field. This phase by itself is unobservable since the probability is 
given by the square of the field, where the phase drops out. But interference 
experiments allow to observe phase changes. Focus Box 7.9 shows you how. 

That all scattering processes are wavelike in nature, and thus described by 
Huygens’ principle, is indeed proven when interference occurs. An important 
interference phenomenon occurs when more than one process can mediate a 
reaction between identical initial and final states. That is for example the 
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The electromagnetic potential changes the phase of charged matter fields. For 
simplicity we will consider a non-relativistic matter particle, where the same 
happens. Let the particle move in an electric potential V(t), which depends 
on time: 


The operator Ho describes the free motion, V is treated as a perturbation. 
The solutions will have a time dependent phase w = nes": 


psi eae ZTU i ee iS _ H, i pues iS |H, ve 
a Ot a Er e + wo e opoe + Ey woe o + Y 
A comparison of coefficients shows that 0S/0t = V (t). Thus on a closed path, 
the field will undergo a phase shift of AS = ¢ V(t)dt. The relativistic answer 
is completely analogous to this, but for a four-potential A,,: 


AS = f Vdt — f Adi = f Ad, 


In the absence of an electrical potential, the 
phase shift is thus proportional to the mag- 
netic flux included in the path. It is observable 

if one lets the phase-shifted field interfere with s< 
the original field. This is the idea of what is 
called an Aharonov-Bohm experiment, a dou- 

ble slit experiment as shown on the right. 


Electrons from the source s are brought to interference through a double slit 
b. In the shadow of the round middle screen, a long thin solenoid a is placed. 
Outside the solenoid, the magnetic field is zero, but the magnetic potential 
decreases inversely to the distance to the solenoid. Thus it cannot be zero 
everywhere, regardless of its gauge. The interference pattern is observed on 
the screen as the intensity as a function of the displacement o. The intensity of 
the interference stripes is modulated by the image of the single slit, as shown 
in Focus Box 5.10. Thus if there were a magnetic force on the electron, e.g. by 
a stray field a’ of the solenoid, the envelope and the interference strips would 
shift alike. If there is no field, only the strips shift under a constant envelope. 
This is exactly what is observed. Chambers in his pioneering implementation 
of the experiment used a ferromagnetic whisker to implement the changing 
potential. Mollenstedt and Bayh used a very fine solenoid electromagnet. In 
a modern implementation, Akira Tonomura [460] and collaborators used a 
superconductor to confirm the result. 


Focus Box 7.9: The Aharonov-Bohm experiment 
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case, when two different virtual intermediate particles can transform the ini- 
tial into the final state, such that no observable allows to distinguish the two 
contributions, not even in principle. The ability to distinguish works regard- 
less of whether the observable is actually measured or not, or even can be 
measured with existing devices. It suffices that it exists in principle. Indis- 
tinguishable processes can interfere with each other at the amplitude level. If 
we have two amplitudes, Mı and Ma describing the two contributions, the 
square of the total amplitude will be |M|? = |M? + M3+ Mı Mal, an inter- 
ference term appears. It can increase or decrease the total probability of the 
process, depending on whether the interference is constructive or destructive. 
A prominent example is the interference between the exchange of photons and 
Z bosons, electroweak interference, which we will discuss in Chapter 9. 

In summary we note that in the framework of a field theory, detailed prop- 
erties of particle reactions can be calculated with the tools of perturbation 
theory. Using a well defined prescription, these properties can be measured, 
such that an experimental check of the theory is feasible, as Feynman requires 
in the initial quote of thus chapter. In particular, it can be shown by example 
that the theory is not valid or limited in its application range. The parameters 
of the theory, like masses and coupling constants can be measured experimen- 
tally. Results of different experiments become comparable. Detailed properties 
are not only absolute or relative cross sections, measuring the strength of a 
reaction, but also their dependence on kinematic quantities like energy or scat- 
tering angle, spin orientation etc. As far as decays are concerned, not only the 
total decay rate or lifetime, but also their branching into different final states 
is calculable. If the structure of the interaction is known, scattering and decay 
experiments can thus be used to investigate the structure of matter. Ruther- 
ford’s experiment to establish the existence of the atomic nucleus is a classic 
example. We will see in Chapter 9 how an analogous method established the 
existence of quarks. 

The elements of a quantum field theory are operators, which describe the 
propagation of quantised fields through spacetime, and operators for their 
pointlike interactions. Feynman diagrams allow to describe particle reactions. 
Not only do they illustrate what happens, each element also has an associated 
mathematical expression, which can be interpreted as a probability amplitude. 
Together they form the set of Feynman rules which define a quantum field 
theory. Their multiplication and integration over space and time give the 
probability amplitude for a reaction, its square then is the probability. 

Quantum electrodynamics is an early prototype of such a theory and 
extremely successful. And it is successful despite the fact that in calculat- 
ing processes beyond the Born approximation, i.e. with more than one photon 
exchanged, divergent terms appear. A sophisticated method, called renormal- 
isation, allows to absorb these infinities into measured quantities, which are 
by definition finite. We will come back to this method and its foundations in 
Chapter 9. Among other consequences, renormalisation leads to the astonish- 
ing fact that coupling constants like the electric charge are not really constant, 
but depend on distance. Applying renormalisation, perturbative quantum 
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electrodynamics makes impressively precise predictions, which impressively 
precise experiments can confront. To give but one example, the magnetic 
moments of electron and muon have been calculated and measured to a rela- 
tive accuracy of a few 10-1? and 107°, respectively. 

One would thus think that the success of quantum electrodynamics ought 
to have led to an immediate effort to apply field theoretical methods to other 
forces. Foremost to weak interactions, which manifest themselves at low ener- 
gies mainly by radioactive decay, but also to strong interactions including 
nuclear forces. This way, the triumph of the Standard Model of particle physics 
would have started in the 1950s. Instead, quantum field theory fell into deep 
disrespect until well into the 1970s. The reason was not only that the treat- 
ment of divergences by renormalisation was viewed by many as a dirty trick, 
designed to sweep basic inconsistencies of the method under the rug. There 
is also a conceptional problem, which has to do with the behaviour of the 
theory at very small distances. It is called the Landau pole problem [324, 
325, 326, 386] after the Russian physicist Lev Landau, a very influential fig- 
ure during that time and leader of a Russian school of particle physicists. 
The existence of a Landau pole is a common feature of field theories which 
are asymptotically free, i.e. which have a vanishing scattering amplitude at 
large distances. Reversely, it leads to a divergence of the coupling constant, 
the electrical charge for quantum electrodynamics, at a large but finite energy 
scale (or at a small but finite distance), independently of its value at “normal” 
energies. This pole has no practical importance for quantum electrodynamics, 
because it occurs at unbelievably small distances, even smaller than the so- 
called Planck scale, which itself is already some 10~2° times smaller than the 
proton radius. Nevertheless the existence of this pole was taken as a sign that 
there was something fundamentally inconsistent and incomplete -thus wrong 
according to the introductory quote of our chapter- with quantum field theory. 

Philosophically speaking it can also be argued that quantum mechanical 
states are not by themselves observable, but only transitions between them 
are, aS Heisenberg kept pointing out. It is thus intellectually attractive to 
demand a theory, which predicts the amplitudes M directly without refer- 
ence to an underlying dynamics, solely on the basis of symmetry and other 
abstract requirements. Such arguments led to the renaissance of the matrix 
method, based on the S-matrix of John A. Wheeler [278]. Leading theorists 
like Steven Weinberg shared these doubts [383]: “It is not yet clear whether 
field theory will continue to play a role in particle physics, or whether it will 
ultimately be supplanted by a pure S-matrix theory.” The hope was that very 
general requirements on the S-matrix, like unitarity and consistency, together 
with symmetry criteria, would completely define it. A perturbative approach 
would then also be unnecessary, the S-matrix would depend analytically on 
kinematic quantities. These hopes led to a strange intellectual atmosphere. 
Frank Wilczek, one of the fathers of Quantum Chromodynamics (see Sec- 
tion 9.2) later recalled [491] that “strict quantum field theory was considered 
to be naive, in rather poor taste, an occasion for apology.” However, no unique 
S-matrix exists, rather there are many theories which are non-perturbatively 


146 m Particles, Fields, Space-Time 


consistent, each with its own S-matrix. Thus in the beginning of the 1970s, it 
was realised that the S-matrix approach was less a theory than a tautology. 
Francis Low said in the discussion following his summary talk at the Interna- 
tional Conference on High-Energy Physics in Berkeley (1966) [391]: “ I believe 
that when you find that the particles that are there in S-matrix theory, with 
crossing matrices and all the formalism, satisfy all these conditions, all you’re 
doing, I think, is again showing that the S-matrix is consistent with the world 
the way it is; that is, the particles have put themselves there in such a way 
that it works out, but you have not necessarily explained that they are there.” 

The second principle taken for granted in such theories was that elemen- 
tary hadrons, particles like proton, neutron and pion subject to strong inter- 
actions, have no constituents. All these hundreds of particles were supposed 
to be equally fundamental, each was to be a dynamical bound state of all the 
others. Following one of its main advocates, Geoffrey Chew, this axiom was 
called “nuclear democracy” [389]. While some like Werner Heisenberg adhered 
to this ideology until the mid 1970s, it has been refuted convincingly by the 
discovery of quarks and the description of their dynamics by quantum chro- 
modynamics, the quantum field theory of strong interactions. We come back 
to these developments in Chapter 9. 

S-Matrix theory nevertheless scored important points. It helped to under- 
stand e.g. that there are relations between current densities which enter into 
matrix elements -or rather into invariant amplitudes in my preferred way of 
thinking. The usage of such relations is called current algebra [374]. It allows, 
among other things, to derive so-called sum rules, specific integrals over dif- 
ferential cross sections, which relate to the properties of hadrons. Since they 
do not make reference to specific dynamics, they stay valid also in a field 
theory which does have definite dynamics. We will come back to all of this 
in Chapter 9. But before that we have to go into details about experimental 
techniques for particle physics and how their development fuelled progress. 
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CHAPTER 8 


Enabling technologies 


The effect of a concept-driven revolution is to explain old things in 
new ways. The effect of a tool-driven revolution is to discover new things 
that have to be explained. In almost every branch of science ...there 
has been a preponderance of tool-driven revolutions. 


Freeman Dyson, Imagined Worlds, 1998 [505]' 


tion technology plays in the development of the Standard Model of particle 
ysics. I will concentrate on successes (as usual) but exceptionally also cover 
failed major projects, because there are interesting lessons to be learned. 


I: THIS CHAPTER I will discuss the role that particle acceleration and detec- 
P 


8.1 COSMIC RAYS 


In the 1930s and 1940s quantum electrodynamics emerged as the only robust 
theory of particles and their interactions. It not only correctly describes the 
transitions in atoms, but the whole wealth of electromagnetic interactions 
between charged matter particles and photons. In the following two decades, 
faith in quantum field theory was lost, but the methodology of extracting and 
interpreting information from experiments was preserved. The main observ- 
ables were and still are cross sections and decay rates. 

During the same period of time, experimental techniques developed with 
impressive speed, as if physicists had only waited for technology to become 
available to turn it into new particle physics experiments. This way, dur- 
ing quite a while, experimental particle physics kept a considerable lead over 
particle theory. It is arguable whether the development of disruptive theo- 
retical ideas has since driven the staggering progress in particle physics [362] 
or whether it was progress in particle acceleration and detection techniques 
which provided the driving force [543]. However what can be safely said is that 
their mutual stimulation brought about today’s situation where predictions 


1 Copyright 1997 by the President and Fellows of Harvard College. Quoted by permis- 
: pyrig y 
sion. 
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of theory are routinely verified or falsified by experiment with unprecedented 
accuracy. In turn, the findings of experiments -or lack thereof- strongly con- 
strain the directions of further theory development. 

As far as particle reactions are concerned, the only particle sources in 
the beginning of the 20th century were radioactive decays and cosmic rays’. 
Charles-Augustin de Coulomb had already observed in 1785 using a torsion 
electrometer, that gas filled capacitors discharge slowly despite careful elec- 
trical insulation. One called this phenomenon “air electricity” and put it in 
the same category as other electrical phenomena in the atmosphere, like light- 
ning. At the end of the 19th century, Julius Elster and Hans Geitel [643], two 
German school teachers, friends and collaborators, showed that the discharge 
was due to the ionisation of air molecules in the capacitor volume, caused by 
charged particles. Around the same time, Becquerel discovered radioactivity 
(see Section 2.2). So the question was whether the charged particles came 
from Earth, from the atmosphere itself, or from outside. Research to answer 
this question was vigorously pursued by the Catholic priest Theodor Wulf 
SJ, using the sensitive and transportable string electrometer he had invented. 
This device is explained in Focus Box 8.1. 

Wulf’s plan was to measure the rate of discharge in an ionisation chamber 
as a function of altitude with respect to sea level. The rate is roughly 1 Volt 
per hour (see Focus Box 8.1), so the measurement takes patience and constant 
conditions. Measuring the rate at various places [130, 136], including the top 
of the Eiffel tower in Paris —at that time the world’s highest building— did 
not reveal a conclusive altitude dependence. Domenico Pacini and collabora- 
tors [144] made similar measurements below the surface of Lake Bracciano and 
concluded that that the radiation was probably not emitted by the Earth’s 
crust. 

The proof for an extraterrestrial origin of the ionising radiation required 
balloon flights. The first ones were made in 1910 by Albert Gockel [132], a 
German working at the newly founded University of Fribourg, Switzerland. 
He used a balloon on loan from the Swiss Aero-Club in Zürich and a rather 
unreliable electrometer he had also borrowed. This was due to the fact that his 
research on air electricity was not at all supported by his boss, who thought 
that he should rather concentrate on solid state physics for the benefit of 
Swiss industry [587, 593], an argument against fundamental physics still heard 
today. The data collected during a short flight over Lake Zürich showed strong 
statistical and systematic variations. Nevertheless they confirmed that the 
radiation did not emerge from the Earth’s surface. Gockel was first to call it 
“cosmic radiation”, but missed a fundamental discovery. Two lessons are to 
be learned from his failure: if discovery is what you are after, use only first 
class instruments; and do not listen to your boss. 


?The literature covering the history of cosmic rays is rather extensive, thanks to the 
centenary of the discovery of their extraterrestrial origin in 2011. See e.g. the articles of the 
special edition of Astroparticle Physics in 2014 [610]. 
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An ionisation chamber is a closed volume filled with gas, normally air. Its 
interior forms a capacitor, the voltage on its plates can be read from the 
outside using an electrometer, like the bifilar one invented by Theodor Wulf 
shown below [589]. Wulf patented the device in 1906 (DRP 181284). 


tel) 


Credit: R.G.A. Fricke and K. Schlegel [589] 
A cut through the instrument is shown on the left. The lower end of thin metal 
filaments (F) was fixed to a quartz string (Q), using its elasticity as a restoring 
force. The string was mounted on an amber insulator (B). An external source 
can be connected to S and the case grounded. Two sodium dryers (Na) reduced 
the humidity on the inside of the device. The distance of the two filaments 
was measured through a microscope (M). A photograph of the instrument is 
shown on the right. It was manufactured by the Günter& Tegetmeyer (G&T) 
company. A similar device [589], modified according to suggestions by Victor 
Hess, was used by him on his balloon flights. 

The rate of voltage decrease dU/dt in the ionisation chamber measures the 
rate of ionisation density dp/dt: 


dU 1dQ _V dp 
d oa O t 


With the capacitance C and the volume V of Wulf’s set-up, a voltage decrease 
of 1 V/h corresponds to an ionisation rate of close to 1 ion/cm?/s. 


Focus Box 8.1: Electrometers and ionisation chambers 


150 m Particles, Fields, Space-Time 


The experimental proof of the extraterrestrial origin of cosmic rays is thus 
rightly credited to the Austrian Victor Francis Hess [142]. In a series of long 
balloon flights up to 5300m altitude and with a first class electrometer man- 
ufactured to his standards (see Focus Box 8.1), he measured the intensity 
of the ionising radiation systematically. His rather precise data showed that 
starting at an altitude of about 800m the intensity increases quadratically 
with altitude. His measurements were repeated with an even higher accuracy 
by Werner Kolhörster [151] for altitudes reaching 8000m. The extraterrestrial 
origin of the radiation was thus established, the term cosmic rays generally 
accepted. In 1929, Walter Bothe and Kolhörster [230] demonstrated the influ- 
ence of the Earth’s magnetic field on the flux of cosmic rays. It was thus shown 
that they consist essentially of charged particles. 

During about four decades after their discovery, cosmic rays stayed the 
dominating source of high energy particles. They cause particle showers by 
interaction with the nuclei of the atmosphere, such that a variety of long 
lived particles reaches the Earth’s surface. Imaging particle detectors allow 
those to be visualised and identified. The first such detector was the cloud 
chamber, developed by the meteorologist C.T.R. Wilson [141, 145]. The track 
of an ionising particle is visualised by condensation of a saturated vapour. 
Wilson shared the 1927 Nobel prize for this invention with Arthur H. Comp- 
ton, the American physicist who discovered elastic photon-electron-scattering 
(see Focus Box 5.3). In 1932, Carl D. Anderson at Caltech [243], using a 
cloud chamber combined with a magnet to form a spectrometer, discovered 
the positron, the anti-electron predicted by Paul Dirac’s equation (see Focus 
Box 7.4). Figure 8.1 shows the spectrometer as well as the cloud chamber 
photo of a positron. Its charge +e is determined through the curvature of its 
track and its specific energy loss, measured through the density of droplets, as 
well as the energy lost when traversing a lead plate. The energy loss of charged 
particles in a medium by excitation and ionisation of atoms is explained in 
Focus Box 8.2. The year after, Giuseppe Occhialini and Patrick Blackett [253] 
at the Cavendish Laboratory of Cambridge confirmed the existence of this first 
antiparticle. Anderson and Hess shared the Nobel prize in physics of 1936 for 
their respective discoveries. 

The second important imaging detector of the time was the photographic 
emulsion, a glass plate covered by a light-sensitive layer. Similar to light on 
a photographic film, ionising radiation leaves a track of grains, which can 
be made visible by developing the emulsion. It can then be analysed with a 
microscope. The grain density is proportional to the specific energy loss by 
ionisation and thus proportional to the square of the particle charge. 

A non-imaging device for counting the passage of ionising particles was 
invented by Rutherford’s collaborator Hans Geiger in 1908 [126]. Its principle 
of operation and that of other gaseous detectors is explained in Focus Box 8.6. 
In its original version it worked only for the heavily ionising a particles with 
charge +2e. When Geiger moved back to Germany, he and Walter Müller at 
University of Kiel perfected the device [226] such that it was also sensitive to 
6 particles with a single charge —e, thus four times less ionisation. It consists 
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The dominating source of energy loss for charged particles in matter are multi- 
ple interactions with atomic electrons. The energy lost by an incident particle 
interacting with a single atomic electron can be calculated classically in anal- 
ogy to Rutherford scattering (see Focus Box 4.5), except that the recoil of the 
struck electron is not at all negligible. In the early 1930s, Hans Bethe [233, 
322] calculated the average energy —dE lost by a projectile of charge ze, mass 
M > m. and velocity 8 = v/c in a thickness dx of a material characterised 
by its volume density n. of atomic electrons, according to quantum mechanics 
and special relativity: 


dE et Nez? | 2mec?B? g ô 
= n 
de mec?4ne B? I(1- 8?) 2 


The term 6, added later, corrects the result for the polarisation of material 
atoms due to the incoming charge. This polarisation effectively reduces the 
charge of distant atoms and thus the energy loss, i.e. 6 > 0. The ionisation 
constant I regroups the global ionisation properties of the atom, i.e. the dif- 
ferent excitation levels weighted by the probability to excite them. It depends 
on the atomic charge Z and is empirically found to be: 


If 1247/2 eV; Z<13 
Zo { 9.76+58.8Z-119 eV; Z>13 

The energy loss per unit length is proportional to z?; an a particle loses four 
times the energy that a proton loses in the same material. Thus measuring 
dE/dx allows to identify particles and measure the absolute value of their 
charge. For non-relativistic projectiles, the term 1/8? dominates the velocity 
dependence. At the end of their range in matter, just before they are stopped, 
charged particles thus lose the most energy. This energy loss maximum is called 
the Bragg peak and used e.g. in irradiating cancerous tissue. All particles have 
a minimum of their energy loss at roughly the same momentum: 


p/m = By ~3to3.5 ; -dE/d«|,,;, > 1 to 2MeV cm?/g 


mien 
At high projectile energies, the energy loss increases logarithmically œ log( 8y). 
The ionisation process is of course statistical in nature. The actual energy 
loss per unit material thickness follows a very asymmetric Landau distribu- 
tion [300] around its mean value given above. 


Focus Box 8.2: Specific energy loss by excitation and ionisation 
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Figure 8.1 Left: Carl D. Anderson in front of his magnetic spectrometer for 
cosmic rays. Right: A cloud chamber photograph of a positron with kinetic 
energy 63 MeV, entering from below, traversing a 6mm thick lead plate los- 
ing energy, and continuing with a curvature corresponding to 23 MeV [252]. 
(Credit: Archives, California Institute of Technology) 


of a gas filled metallic tube, serving as an ionisation chamber and cathode, 
with an axial anode wire strung in its centre and put under high voltage. When 
an ionising particle liberates electrons from the gas atoms, they are acceler- 
ated towards the anode. Since the field becomes very strong close to the thin 
anode wire, the drifting electrons gain momentum and can themselves ionise 
gas atoms. Thus an electron avalanche causes an appreciable current surge 
between anode and cathode. It can be made audible with a loudspeaker as a 
sharp snapping sound, and can be fed into an electronic circuit. The invention 
of the coincidence circuit for Geiger-Müller counters by Walther Bothe [234] 
and Bruno Rossi [392, p. 43f] enabled the systematic classification of particles 
in cosmic ray showers. In 1936 Carl Anderson and Seth Neddermeyer [269, 
273] on the American west coast, Jabez Street and Edward Stevenson [277] 
on the east coast, simultaneously discovered the muon. It is a brother particle 
to the electron, but roughly 200 times heavier (see Section 9.1). Cecil Powell, 
César Lattes and Giuseppe Occhialini in 1947 discovered the pion, the lightest 
hadron*. The same year, Clifford Butler und George Rochester discovered the 
kaon, the first member of the second generation of hadrons, which were called 
“strange particles” . 


8.2 PARTICLE ACCELERATORS 


Starting in the 1940s, accelerators* gradually took over as sources of ener- 
getic particles, to systematically produce new and unknown states and study 


3A hadron is a bound state of quarks, see Section 9.1. 
4A list of past and present accelerators for particle physics is offered by Wikipedia at 
https: //en.wikipedia.org/wiki/List_of_accelerators_in_particle_physics 
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their properties. With that, one no longer depended on anecdotal observa- 
tion of particles and their reactions, but could produce and study them under 
controlled conditions. 

Accelerators were first developed in the early 1930s to push ahead nuclear 
research, which until then had been stuck with the use of œ particles from 
nuclear decay, with fixed energies of a few MeV and low rates. Accelerators 
produce beams of charged particles using the Lorentz force discussed in Focus 
Box 2.5. Electric fields are used to accelerate particles by increasing their 
momentum. Magnetic fields serve to deflect them from their original direc- 
tion. The simplest accelerator is the cathode ray tube (see Figure 2.5), where 
the accelerating structure is basically a capacitor. Static high voltages are 
generated from AC sources, like in the Cockcroft-Walton high voltage multi- 
plier [245, 246], invented in Rutherford’s Cavendish laboratory. Alternatively 
one can use mechanical transport of charges like in the Van de Graaf high 
voltage generator [254], invented at Princeton. 

These inventions mark the beginning of the end of table-top experiments 
in nuclear and particle physics. The heavy and voluminous equipment hardly 
fitted into existing buildings. Careful engineering was required to make them 
work, in addition to good ideas. On the longer term this also led to a change 
in sociology of particle physics experiments. As detectors grew to cathedral 
size with increasing energies, research groups enlarged to several thousand 
members today. The second world war and the period immediate following 
it saw the creation of large research centres, able to create and maintain the 
required infrastructure, like the National Laboratories in the U.S. and CERN 
in Europe. 

Both principles of high voltage generation for low currents are used until 
today to accelerate protons and ions; many demonstration experiments also 
use them. The reachable energy, given by the product of particle charge q 
and accelerating voltage V, E = qV, is limited by the stability of the high 
voltage insulation. Thus, it does not exceed about 25 MeV. To reach a higher 
energy requires that the projectile passes several times through an accelerating 
potential, like in a circular or linear accelerator. 

The simplest circular accelerator is the cyclotron. The physics of this and 
other circular accelerators is explained in Focus Box 8.3. The cyclotron makes 
use of the fact that for non-relativistic velocities, the revolution frequency 
of a charged particle, the cyclotron frequency, is independent of energy and 
only depends on the magnetic field. Already in 1929, the 28-year-old professor 
Ernest O. Lawrence, who had just joined University of California’s Berkeley 
campus, developed and built a first cyclotron. Lawrence patented this inven- 
tion in 1934 as U.S. Patent 1,948,384 and obtained the Nobel prize in 1939. 
That same year, Berkeley disposed of the world’s highest energy accelerator, 
the 60-inch cyclotron shown in Figure 8.2. 

As the particle velocity approaches the speed of light, it depends less and 
less on the particle momentum or energy. This asymptotic saturation towards 
the speed of light is very useful, provided that the radius of curvature is 
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Magnetic field bends 
path of charged particle. 


The simplest circular accelerator is the cyclotron. 
In a uniform and constant magnetic field B, a 
particle of charge e moves on a circle of radius rer 


accelerates 


R = p/(eB) where p is its momentum (see Focus oasa [3 
Box 2.1). The angular frequency of this movement "s s 7 
is called cyclotron frequency we = (e/m)B. ef a 

In the non-relativistic domain, this frequency is constant because the circum- 
ference of the orbit increases proportional to the velocity of the particle. The 
particle, typically a proton, is accelerated by the electric field present between 
the two D-shaped cavities. This field is provided by a radio frequency genera- 
tor such that its frequency is an even multiple of the cyclotron frequency. The 
cyclotron has its limits in that the velocity of the particle does not remain 
proportional to the momentum, but asymptotically approaches the speed of 
light, v/c = \/1 — m?/E?. In the relativistic limit, the velocity barely increases 
despite a steady increase in energy. Acceleration radio frequency and particle 
revolution frequency thus quickly fall out of phase in a cyclotron, at an energy 
of some tens of MeV for protons. One must then adjust the radio frequency 
to the relativistic velocity. 

Alternatively, one can keep the field B proportional to the momentum, as in 
the synchrotron. In that case, one can work with a constant accelerating radio 
frequency. The RF field is transmitted to the beam by resonant cavities. The 
synchrotron principle requires a certain initial velocity sufficiently close to 
the speed of light. The beam is thus usually pre-accelerated before injection, 
typically by a linear accelerator. The acceleration process must stop when one 
reaches the maximum field of the dipole magnets. One then either extracts the 
beam or converts the accelerator into a storage ring. In this latter operational 
mode, radio frequency provides at each turn just the energy lost by the beam 


via bremsstrahlung. 
The separation of functions and 


Beren p the concentration of the components 
i around the ring allows the combin- 
EL] ing of two rings to construct what is 
| ruses Vera N called a collider, originally invented 
u K ; y by the Norwegian Rolf Widerge work- 
An ? ing in war-time Germany [442]. A 
IT rennen i historical example is the Princeton- 
, Stanford storage ring [385, 408]. It 
ae 4 worked with electron beams in the 
IF -— "Pics inter two rings which intersect in the mid- 
dle, where the two storage rings 
touch. 
The detector, a set of spark chambers (see Section 8.3), determined the scat- 
tering angle. Obviously, one can also fold the two accelerators on top of each 
other, or even combine them to collide particles and antiparticles. 


Focus Box 8.3: Circular particle accelerators 
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Figure 8.2 Left: The 60-inch cyclotron at Berkeley’s Rad Lab in 1934. Ernest 
Lawrence is second from the left (Credit: Science Museum London/Science 
and Society Picture Library). Right: The 600-MeV Synchrocyclotron (SC), 
CERN’s first accelerator, came into operation in 1957 (Credit: CERN). 


held constant. In this case, the rotational frequency of the beam is again 
independent of energy. This is the principle of the synchrotron. By increasing 
the magnetic field proportional to the momentum of the particle, the radius 
of curvature remains constant. One does not have to fill a large volume with a 
magnetic field, but can concentrate it inside a vacuum chamber, whose shape 
approximates a circle. The synchrotron was invented by Vladimir Veksler [301] 
and independently by Edwin McMillan [304]. When particles are injected on 
the rising flank of the accelerating electric field, they gather automatically into 
synchronous bunches: late comers see more field, early ones less. The phase of 
the circulating particles with respect to the radio frequency is thus stabilised. 

In 1954, University of California at Berkeley started exploiting the Beva- 
tron [335], built again under the leadership of Ernest Lawrence. This pro- 
ton accelerator, a synchrocyclotron shown in Figure 8.3, reached a maximum 
energy of 6.2 GeV, but with a wide beam of four square feet cross section. 
Its name comes from the energy “billion electron volts” (10° eV, today called 
Giga electron Volts, GeV). Its maximum energy was high enough to produce 
proton-antiproton pairs from a hydrogen target at rest. It was thus the first 
accelerator explicitly built to test a theoretical prediction, namely that each 
particle has an antiparticle with identical mass but opposite charge. For the 
electron, Anderson had already shown with cosmic rays that this was true, for 
baryons, i.e. nuclear building blocks, the proof required accelerators. 

The transverse beam size can be dramatically reduced by magnetic 
lenses. Quadrupole and sextupole magnets are used for what is called strong 
focussing, to distinguish it from the by-product of weak focussing by dipole 
magnets. The first strongly focussing accelerator with much smaller beam size 
and thus higher luminosity (see Section 7.3) was the Alternating Gradient 
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Figure 8.3 Left: The Berkeley Bevatron in 1958 (Credit: The Regents of the 
University of California, Lawrence Berkeley National Laboratory, image used 
by permission). Right: A hydrogen bubble chamber built by the Berkeley 
group and NIST. The proton beam crosses the bathtub-shaped chamber body 
horizontally. The cameras are placed on the conical top. The whole chamber 
is immersed in a cryostat for cooling. Its cover is seen at the top. (Credit: 
National Institute of Standards and Technology). 


Synchrotron (AGN)? at the Brookhaven National Laboratory, which reached 
a proton energy of 33 GeV in 1960. Next to energy, luminosity is the second 
important figure of merit for an accelerator. It is the proportionality factor 
relating the reaction rate to the cross section. The higher the luminosity, the 
higher the observable rate of a given reaction. 

The first accelerator at the European centre for particle physics CERN, 
the Synchrocyclotron shown in Figure 8.2, started its regular service to exper- 
iments in 1957. A bit later followed the strong focussing Proton Synchrotron. 
Since then CERN has built and operates a whole series of accelerators®, 
schematically shown in Figure 8.5, with steadily increasing energy and lumi- 
nosity, such that today it is the leading particle physics laboratory in the 
world. 

When an accelerator is used in what is called a fixed target experiment, a 
beam of particles is extracted and steered towards a stationary target where 
it interacts. The target is followed or surrounded by detectors to measure the 
final state. Alternatively, in a storage ring or collider, two particle beams are 
brought to collision, usually in the centre-of-mass of the two particles. The 
advantage is that no collision energy is lost in the recoil of the target, thus 
the whole centre-of-mass energy of projectile and target is available for the 
reaction. If both beams have the same energy, that centre-of-mass energy is 
simply twice the beam energy, whereas it is much smaller in a fixed target set- 
up, as shown in Focus Box 8.4. Recent examples of colliders are the Tevatron 
at the Fermi National Laboratory (FNAL) close to Chicago in the U.S., which 


5See https: //www.bnl.gov/about/history/accelerators.php 
6See https: //home.cern/science/accelerators/accelerator-complex 
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collided protons and antiprotons until 2011 and the Large Electron Positron 
collider, LEP, at CERN, which collided electrons and positrons. It was in 
operation between 1989 and 2000 in the tunnel which now houses the Large 
Hadron Collider, LHC, up to now the world’s highest energy collider. 


We will confront the kinematics of a fixed target and a collider experiment. 
In a fixed target experiment, a projectile of mass m and energy-momentum 
(E,p) interacts with a target of mass M with (M,0,0,0) . The total energy- 
momentum four-vector of the system is thus: 


( ae) 
Dp 


This four-vector is conserved in the collision, such that the total mass square 
of the final state, s, is: 


s = (E+ M} — P =m? + M?+2EM ~2EM 


The last approximation holds for E > m, M. To give a numerical example, if 
an electron of 100 GeV energy impacts on a proton, the total mass, v/s, which 
can be created in the final state is about 14 GeV. 

In a collision in the centre-of-mass system, the two incoming particles have 
four-momenta (E, p) and (£’,—p). The total four-vector and the total mass 
are thus: 


( a ) ; s=(E+P) 


For two particles of 100 GeV energy impacting on each other, the final state 
mass y's is thus simply 200 GeV. 


Focus Box 8.4: Centre-of-mass energy 


Figure 8.4 shows that the centre-of-mass energy’ reached by accelerators 
and storage rings increased roughly exponentially with time. The most pow- 
erful storage ring to date, the Large Hadron Collider (LHC)® of CERN, falls 
short of such a straight forward extrapolation from previous machines. How- 
ever, its energy -currently 13 TeV and probably 14 TeV starting in 2021- 
is about 100,000 times higher than what the 60-inch cyclotron of Berkeley 
reached in proton-proton collisions 80 years earlier. In operation since 2008, 
the LHC proton-proton collider will dominate experimental particle physics for 
decades to come. Figure 8.5 shows the current complex of accelerators which 
CERN is operating. Planning for even larger machines is going on under the 


‘For better comparability, the diagram uses the mean quark energy and not the mean 
proton energy for hadron machines. 
8See https: //home.cern/science/accelerators/large-hadron-collider 
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auspices of the European Committee for Future Accelerators, ECFA®. CERN 
has become a world centre for accelerator based particle physics, with many 
contributions and contributors from outside Europe. A study is under way 
to assess the feasibility and physics of a giant circular collider at CERN, the 
Future Circular Collider’®, with collision energies ten times higher than the 
LHC. 

Since light charged particles lose energy by bremsstrahlung when their 
trajectory is bent, linear accelerators are interesting alternatives to circular 
ones, especially for high energy electron beams. In fact, accelerator complexes 
all start with a linear accelerator right after the particle source, feeding beams 
into higher energy acceleration stages. The physics of linear accelerators is 
explained in Focus Box 8.5. A pioneer of this technology is the Stanford Linear 
Accelerator Center (SLAC), where major contributions to the discovery of 
quarks were made colliding high energy electron beams with protons and 
neutrons (see Section 9.1). While it is straight forward to direct the beam from 
a linear accelerator towards a stationary target, constructing a linear collider 
is rather complex. So far the only example, the Stanford Linear Collider, was 
also housed at SLAC. Plans also exist to build a major facility using colliding 
beams from linear accelerators!!. 

Astrophysical phenomena can accelerate particles to energies which are 
many orders of magnitude higher than those man-made accelerators can ever 
hope to achieve. The highest energy particles observed so far reach beyond 107° 
eV. They are, however, extremely rare. Their flux near Earth is as low as one 
particle per square kilometre per century. Cosmic rays, photons and neutrinos 
have become a subject of intense study again, in the emerging field of astro- 
particle physics. Because of their low fluxes at high energies, they are no longer 


°See https: //ecfa.web.cern.ch 
10See https: //home.cern/science/accelerators/future-circular-collider 
11See https: //ilchome.web.cern.ch and http://clic-study.web.cern.ch 
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In a circular accelerator of radius p, the power P lost at each turn is P x y3/p 
with the relativistic factor y = E/m, the particle energy E and mass m. The 
power lost is thus very much larger for electrons than for protons at the same 
radius and energy. One can eliminate this loss by increasing the curvature 
radius p, decreasing the bending field of the dipole magnets. 


The limit p > oo gives a linear accel- 
erator sketched on the right. Obvi- ee a= a= aS SS 
ously, in contrast to circular machines, n 

_ ionso | 
the beam passes through each acceler- !on source RF oscillator Copper envelope 
ating structure only once. 


A large part of the vacuum tube forms itself an accelerating radiofrequency 
structure, in which an electromagnetic wave passes. Between the resonating 
cavities, there are field-free spacings realised by conducting tubes. The length / 
of these spacings must follow the velocity v of the accelerated beam, such that 
l = vT/2, where T is the period of the radiofrequency wave. The direction 
of the electric field in the cavities between tubes must obviously also be of 
the required strength and direction. Once the particle velocity approaches the 
speed of light, ! becomes constant. 

The particles are efficiently accelerated in this wave-guide structure, provided 
that: 


e the electric field has a large longitudinal component along the beam 
direction and small transverse components; 


e the phase velocity of the wave is roughly equal to that of the particles 
in the beam. 


Since particles pass through the structure only once, one must aim for very 
high values of the electric field. This is usually done with superconducting 
cavities, which can reach electric fields of the order of 25 MV/m. A large 
accelerator nevertheless needs thousands of accelerating structures and cannot 
be too compact. 


Focus Box 8.5: Linear accelerators 
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Figure 8.5 The CERN accelerator complex. Machine acronyms are indicated 
and explained in the legend. Acronyms of the four LHC experiments are also 
shown. Almost all machines built in over 60 years are still used, the smaller 
ones feeding pre-accelerated beams into the larger ones. (Credit: CERN) 


used as a particle source. Instead, they carry important information about 
their astrophysical and cosmological sources and the accelerating mechanisms, 
which are able to reach these fantastic energies. More than a hundred years 
after their discovery, the production, acceleration and transport processes of 
cosmic rays are still mostly unknown and in the focus of experiments on Earth 
and in space (see Section 10.2). 


8.3 PARTICLE DETECTORS 


In parallel to the fast-paced development of accelerator technology, methods to 
detect and count them, and to measure their properties, have made impressive 
progress. In the beginning of the 20th century, particle detection was mostly 
about observing radioactive decay and cosmic rays, in addition to atomic spec- 
troscopy. The counting rate of the Geiger and Marsden experiments, scattering 
a particles off nuclei never exceeded a few Hertz, and could be determined by 
observing a phosphorescent screen by eye. As far as the discovery of new parti- 
cles was concerned, imaging techniques, like cloud chambers and photographic 
emulsions, played a crucial role. These devices not only visualise the impact of 
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an ionising particle, as the phosphorescent screen does, but allow to follow its 
trajectory. Measuring the range and specific energy loss of charged particles 
(see Focus Box 8.2) together with their curvature in a magnetic field allows to 
estimate the mass and charge of particles. Yet the observations stayed anecdo- 
tal, a systematic measurement of particle properties and their reactions with 
significant statistics was not feasible with imaging techniques alone. 

That changed with the invention of electronic methods of particle detec- 
tion, pioneered by Hans Geiger and Walther Müller, and especially with the 
electronic registration of their signals via the coincidence method invented by 
Geiger, Bothe and Rossi (see Section 8.1 above). This is an electric circuit, 
using vacuum tubes at the time, which compares the time of arrival of two 
or more signals from detection devices, and triggers a counting or measuring 
process if they are coincident within a defined window. The method allows 
e.g. to do charged particle spectroscopy detecting them before and after an 
absorber, which defines an energy threshold. For imaging detectors, it allows 
triggering a camera whenever an interesting particle traverses the chamber. 

An important progress in imaging detectors was the invention in the 1950s 
of the bubble chamber by Donald A. Glaser [320] at Ann Arbor, Michigan, 
working with his advisor Carl Anderson. Here a liquid is overheated under 
pressure. At the passage of particles, the pressure is released. Along the track 
of an ionising particle small bubbles form within microseconds, due to the 
local heating by ionisation. They are photographed using flash illumination. 
Pressure is then restored to be ready for the next passage. Since the thermo- 
dynamic process as well as pressurisation and release are not very rapid, this 
technique is limited to rates of the order of a Hertz. But it images particle 
reactions with rich detail. When the chamber is immersed in a magnetic field, 
it forms a spectrometer. The track curvature measures the particle momen- 
tum. The bubble density along the track measures the specific energy loss 
and allows to distinguish between light and heavy particles. Even photons, 
which do not themselves leave a track, can be observed if they convert into 
electron-positron pairs. Last but not least, bubble chamber pictures give an 
immediate visual impression of particle reactions. I for one was attracted to 
particle physics by precisely this feature. 

In parallel to the accelerator development at Berkeley after World War 
II, the group of Luis Alvarez developed larger and larger bubble chambers 
filled with liquid hydrogen. One of their chambers, in the process of being 
inserted into its cooling dewar, is shown in Figure 8.3. Together with the 
Bevatron beam, the bubble chamber technique led to the discovery of a large 
number of hadrons and hadronic resonances. Both Glaser (1960) and Alvarez 
(1968) obtained the Nobel prize in physics for their important contributions 
to particle detection technology. The pictures of a bubble chamber had to be 
scanned by eye for interesting reactions. The particle tracks were digitised by 
hand and results typically stored on punched cards, later to be analysed by 
computers. 
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Louis Alvarez was a prolific inventor with no less than 17 U.S. patents. 
During the war he contributed to military efforts with the development of 
radar and an important contribution to Project Manhattan (see Chapter 6). 
His interests ranged from particle physics to dinosaur extinction, Egyptology 
and aviation. 

Following the creation of intense proton beams at the CERN Proton Syn- 
chrotron and the rising interest in neutrino physics, the need arose for a mas- 
sive detector capable to detect their very rare interactions [484]. This led to 
the proposal by André Lagarrigue of Ecole Polytechnique in Orsay, France, to 
build a large bubble chamber filled with a heavy liquid like freon. The result 
was the heavy liquid bubble chamber Gargamelle, named after a character 
from the Renaissance fantasy novels by Francois Rabelais. It was almost 5m 
long and 2m in diameter, filled by liquid freon or propane, and immersed in a 
2T magnetic field. The interior was viewed by eight cameras providing stereo- 
scopic images of particle reactions. A photograph of the set-up is shown in 
Figure 8.6. It was operated with neutrino and antineutrino beams from 1970 
to 1978. Until the 1980s, pictures from Gargamelle and its successor BEBC 
(1971 to 1984) were analysed by technical personnel and physicists using pro- 
jection tables like the one shown in Figure 8.6. Again, particle tracks were 
digitised by hand in projection, then reconstructed in 3D using sophisticated 
computer programs. The results were stored on punched cards or magnetic 
tape for further analysis. The most important result from Gargamelle was the 
discovery of weak neutral current interactions, discussed in Section 9.2. 

More target mass, but less spacial resolution and detail is provided by 
another imaging detector, the spark chamber invented by Shuji Fukui and 
Sigenori Miyamoto in the late 1950s [343, 350]. It consists of a series of parallel 
plate capacitors, filled with a gas mixture, typically on the basis of noble gases. 
A high voltage is applied to the metal plates with alternating polarity. When 
an ionising particle passes, a localised spark develops marking its position in 
every gap. The spark pattern can be photographed, it gives a coarse graphic 
representation of a reaction, which typically happens in one of the metal plates. 
The plates can be made of high density material like iron or lead, such that 
the target mass presented to the incoming beam is substantial. Therefore, 
spark chambers were preferentially used in experiments to observe small cross 
section reactions, like early neutrino experiments in Brookhaven and CERN. 
Spark chambers are still often used today to demonstrate the passage of cosmic 
rays. 

The breakthrough towards digital technologies started with the invention 
of the multi-wire proportional counter by George Charpak (Nobel prize in 
1992, Figure 8.7). The principles of operation of gas filled particle detectors 
are explained in Focus Box 8.6. In a proportional chamber, densely strung 
thin anode wires are set to high voltages, such that when an ionising parti- 
cle passes, a cloud of electrons is created near the anode, but in contrast to a 
Geiger-Müller counter no break-down occurs. Under the influence of the strong 
electric field near the wires, electrons are accelerated towards the anode and 
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Figure 8.6 Left: The CERN bubble chamber Gargamelle. The large Helmholtz 
coils (white) generate a strong magnetic field orthogonal to the chamber axis, 
which is oriented along the direction of the incoming beam. The inside of the 
chamber body is lit by flash lights and viewed by eight cameras. Right: A pro- 
jection table constructed by Saab in Sweden, projecting Gargamelle pictures 
on a table for scanning and digitisation. The tracks of final state particles are 
followed by hand and digitised. A computer program then reconstructs the 
reaction products in three dimensions. (Credit: CERN Courier, June 1973) 


cause an avalanche. An electrical signal is thus induced which can be ampli- 
fied and digitised. This allows to track the particle trajectory, with subsequent 
wire-by-wire measurements. The signal height is proportional to the specific 
energy loss of the particle, explaining the name proportional chamber, and 
allows to determine the absolute value of the particle charge. With this inven- 
tion and the development of sensitive analog-to-digital converters, the step 
from analog to digital particle detection techniques was thus taken. 

Even more precise localisation of particle tracks is achieved by semicon- 
ductor technology, which has largely taken over from gaseous detectors since 
the later part of the 20th century (see Focus Box 8.7). Thin silicon wafers, 
similar to computer chips, are doped to form diodes, with a potential barrier 


= = 


Figure 8.7 George Charpak, Fabio 
| Sauli und Jean-Claude Santiard (left 
to right) testing a multiwire propor- 
| tional chamber at CERN in 1970. 
(Credit: CERN Courier, November 
2010) 
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To recuperate electrons and ions before they recombine, there is an electric 
field collecting them on anode or cathode, respectively. The arriving charges 
cause a short current surge, detected after suitable amplification. The number 
of electron-ion pairs created is N; = — (dE/dx) d/W, where dE/dx is given by 
the Bethe-Bloch formula (Focius Box 8.2), d is the thickness of gas traversed 
and W is the mean ionisation energy required for a single pair; in a gas W ~ 30 
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In the proportional region, the signal is amplified by an ionisation avalanche 
close to the anode, still proportional to the number of ionisation pairs but 
amplified by a factor of 10* to 10°. The signal is thus large, but subject to 
variations in amplification due to high voltage, temperature and other envi- 
ronmental factors. 
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When the high voltage is further increased, in the Geiger-Müller region, 
the acceleration of primary electrons increases rapidly and they excite an 
avalanche early on. In addition, a large number of photons is produced in 
the de-excitation of ionised atoms. They also contribute to the avalanche by 
photo-electric effect. A discharge is caused giving a large electrical signal on 
the anode. It can be made audible as a sharp snap. Following the discharge, 
the counter is insensitive during what is called its dead time. 


Focus Box 8.6: Gaseous particle detectors 
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in the junction zone. By applying a modest external voltage, this zone can 
be enlarged in a controlled way. When an ionising particle crosses, electrons 
are lifted from the valence to the conduction band and a short current surge 
passes the diode. The energy necessary to create an electron-hole pair in the 
semiconductor is ten times less than the ionisation energy of a typical gas 
atom. The signal is thus larger at comparable ionisation energy loss. It is also 
very localised, such that a structured metallisation of the surface allows spa- 
tial resolutions of the order of micrometers. Tracking devices together with a 
transverse magnetic field form a spectrometer, a detector to measure the par- 
ticle momentum. Since the time integral of the signal current is proportional 
to the specific ionisation dE/dx, spectrometers can also contribute to particle 
identification. 

A fundamentally different approach to the measurement of particle energy 
is the calorimeter. Its idea is to slow down and absorb particles completely, 
converting their energy loss into an electrical signal. Homogeneous materi- 
als which scintillate are an obvious choice. Scintillators are transparent inor- 
ganic or organic materials, which convert ionisation energy loss into light, 
with wavelengths often in the visible range. Examples are crystals like bis- 
muth germanate or lead tungstate, lead glass and doped plastics, but also 
liquids like liquid noble gases. The light is collected and measured by a pho- 
todetector. Electrons liberated by photoelectric effect are multiplied by an 
avalanche in an electrical field; arrival time and signal height are measured. 
A layered construction, with passive metallic absorbers and active counting 
layers, allows absorbtion of high energy particles. In both homogeneous and 
layered calorimeters one uses the fact that high energy particles cause showers 
in matter'?, such that their energy is distributed over many particles and it 
suffices to count those to arrive at an energy measurement. 

In addition to detectors which localise particles, determine their time 
of arrival and measure momentum or energy, there are many others which 
help identifying them. Examples are Cherenkov counters, transition radia- 
tion detectors and time-of-flight systems [503]. They are all sensitive to par- 
ticle mass in one way or another. It is also common to all modern particle 
detectors that they generate electrical signals which can be digitised, anal- 
ysed by computers and stored. In this way, synergy and redundancy between 
different detector types is used to optimise the reliability and accuracy of 
the measurement. This is of increasing importance, since modern particle 
physics experiments are often only repeatable in principle, due to the large 
effort necessary to construct accelerators and detectors. Prominent exam- 
ples are the multipurpose detector systems ATLAS (https: //atlas.cern/) 
(see Figure 9.14) and CMS (https://cms.cern/) at the Large Hadron Col- 
lider of CERN. The same is also true for modern particle experiments which 
do not use accelerators. Examples are the spectrometer for cosmic rays 
AMS (http://www.ams02.org/) (see Focus Box 10.3) on the International 


12See e.g. https: //en.wikipedia.org/wiki/Particle_shower 


166 mM Particles, Fields, Space-Time 


Modern ionisation detectors are often based on semiconductor sensors to 
localise particles and measure their specific energy loss dE/dx. A particle 
traversing a semiconducting solid, like e.g. a silicon diode, will create electron- 
hole pairs. Electrons can be lifted from the valence band into the conduction 
band by relatively little energy: only 3 eV are typically required, compared to 
typically 30 eV to ionise a gas molecule. This is due to the narrow gap between 
both bands and the fact that the Fermi energy, up to which the energy lev- 
els are completely filled, is situated right between the two. The probability 
density to find an electron at a given energy E at temperature T in a crystal 
1s: 


1 
f(E) = e(E-Er)/kT 4 1 


wa 


with the Boltzmann constant k (see Section 5.1). The Fermi level Er is 


defined as the energy where f(Er) = 1/2. 
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The gap between valence and conduction band is typically 1.12 eV wide for 
silicon at 300 K. The basis of a semiconductor diode are two layers, one of 
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A depletion zone is formed around the contact, which works like a diode. 
When one applies an external voltage, a so-called reverse bias, the depletion 
zone can be enlarged. Electrons and holes liberated in this zone migrate to 
the two faces, within nanoseconds and very locally. Because of the low energy 
required to create mobile charges, the efficiency of such a detector is high, its 
response is linear in dE/dx and its energy resolution is very good. Only for very 
large ionisation densities, like those of heavy ions, saturation effects set in. To 
localise particles, the surfaces of semiconductor sensors are metallised with fine 
structures where charges are collected and fed into further electronic stages. 
Besides particle physics, semiconductor detectors are used in light detection, 
e.g. by CCD cameras. 


Focus Box 8.7: Solid state particle detectors 
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Space Station, and the neutrino observatory Ice Cube at the South Pole 
(https: //icecube.wisc.edu/). 

The electronic signals from modern detectors are digitised and stored. They 
are first used online to decide whether an interesting reaction has occurred. If 
so, the data are kept for further processing. Data from different parts of the 
detector, but relevant to the same final state particle, are then coordinated 
and used to reconstruct the final state of the reaction with as much detail 
and accuracy as possible. These data can then be compared to theory pre- 
dictions. For this purpose, reactions and their final states are generated in a 
computer program following the prescriptions of a given theory. The response 
of the detector to the final state particles is then simulated. This numeric 
process, called Monte-Carlo simulation, was pioneered during the Manhattan 
project (see Section 6.4). Reaction cross sections and distributions of final 
state properties can thus be statistically compared between experiment and 
theory. 

The high luminosities of modern colliders lead to substantial reaction 
rates even for processes with tiny cross sections. Systems of particle detec- 
tors deliver extremely detailed information about every single event. Particle 
physics experiments thus need to use every step up in computing, data storage 
and analysis technology. From Rossi’s coincidence circuit to neural networks 
in modern experiments, experimental particle physics has accompanied and 
often enough driven this progress. An excellent example is the invention of the 
World Wide Web by Tim Berners-Lee at CERN in 1989!%. Originally meant 
to organise and access data and documentation of physics experiments, it 
has caused a revolution in everybody’s way of communicating and access to 
knowledge. 


8.4 FAILED PROJECTS 


From the above you may get the impression that the history of particle physics 
technology consists of an uninterrupted sequence of breakthroughs and shining 
successes. While this is true in general, history is also marked by important 
set-backs, which have taught us how to live with failures and learn from them. 

There are major accelerator projects, which have never been put to a 
successful completion. A first prominent example is the ISABELLE collider 
project at the Brookhaven National Laboratory. It was supposed to be an 
intersecting storage ring, colliding 400 GeV on 400 GeV protons, succeeding 
the pioneering and successful CERN ISR [582], which had 31+31 GeV centre- 
of-mass energy and ran from 1971 to 1984. 

The idea of high energy proton colliders was based on original work by 
Gerard K. O’Neill in 1956 [337], who proposed two separate storage rings side- 
by-side with a single interaction point in common, like the Princeton-Stanford 
electron collider [385, 408] (see Focus Box 8.3). What was proposed instead at 


13For a copy of his original proposal see http: //info.cern.ch/Proposal.html 


168 m Particles, Fields, Space-Time 


Brookhaven in 1973 were two storage rings on top of each other with multiple 
intersections, in the spirit of the CERN design, but with superconducting 
magnets [414]. 

A year later, the U.S. High Energy Physics Advisory Panel recommended 
that ISABELLE should be built. Construction began in 1978, with a prototype 
magnet successfully tested in parallel. Series production of these magnets, 
however, faced serious quenching problems, and caused considerable delays. 
Meanwhile, the construction of the SppS collider at CERN and the subsequent 
discovery of the W and Z bosons (see Section 9.2) preempted the original 
physics justification of the collider. 

In addition, starting in 1976 a proposal for a much larger machine, even- 
tually called the Superconducting Super Collider, a proton-proton system to 
collide 20 TeV protons on 20 TeV protons, was discussed. On July 13, 1983, 
the U.S. Department of Energy cancelled the ISABELLE project after spend- 
ing more than 200 million U.S. dollars on it. However, not all was lost. Parts of 
the tunnel, experimental hall and magnet infrastructure built for ISABELLE 
were finally reused to construct the Relativistic Heavy Ion Collider (RHIC), 
which began operation in 2000. 

A far bigger blow to U.S. particle physics was the failure of the subse- 
quent Superconducting Super Collider (SSC) project. In the beginning of the 
1980s a plan was put forward, with prominent supporters like Fermi National 
Laboratory director Leon Lederman and theorist Steven Weinberg as well as 
a majority of the U.S. particle physics community. In the first half of 1984 
an ad hoc group prepared the Reference Designs Study [463], in which three 
possible designs for the SSC were developed and compared. At that same 
time a Central Design Group was created and Maurice Tigner of Cornell was 
appointed as its director. In October 1984 the CDG officially began its work 
of coordination and guidance for a national research and development effort 
at Brookhaven National Laboratory, Fermi National Laboratory, Lawrence 
Berkeley Laboratory, the Texas Accelerator Center, and participating uni- 
versities. The group delivered a Conceptual Design Report in March 1986, 
which was reviewed by the DoE. The project gained presidential approval 
by the Reagan administration in January 1987, in “a deliberate attempt to 
reestablish U.S. leadership in a scientific discipline the nation had long dom- 
inated. If other nations were to become involved, they would have had to 
do so as junior partners in a multibillion-dollar enterprise led by U.S. physi- 
cists” [638]. Subsequently, candidate sites were invited to make proposals. 
The DoE received 43 proposals by September 1987, of which 36 were initially 
retained as qualified!®. A committee narrowed down the choice to a short list 
of seven sites. Among them was the established Fermi National Laboratory 
close to Chicago, but also sites without major research infrastructure. One of 
the latter was selected. The site -2600 hectares of fields and pastures outside 


14See https: //en. wikipedia. org/wiki/Relativistic_Heavy_Ion_Collider 
15See https: //1ss.fnal.gov/archive/other/ssc/ssc-doe-eis-0138-22. pdf 
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of Waxahatchee in Ellis County, Texas- was announced a day after the Texan 
George Bush Sr. was elected president in 1988. The announcement had been 
delayed to “avoid the appearance of backroom favouritism” [465]. Congress in 
1989 voted 331 to 92 to proceed with the construction. 

During the design and the first construction stage, a heated debate ensued 
about the high cost of the project. In 1987, Congress had been told the project 
could be completed for 4.4 billion U.S. dollars, an adventurous estimate con- 
sidering that there was no existing infrastructure nor a strong national labo- 
ratory existing on site. In 1993, the NGO Project on Government Oversight, 
in an open letter to president Clinton, released a draft audit report by the 
Department of Energy’s Inspector General heavily criticising the project for 
its high costs and poor management [472]. By then, the cost estimate had 
evolved to almost three times the initial estimate. 

As far as politicians were concerned, cost overruns and poor management 
were the official reasons for cancelling the project. But in hindsight, many rea- 
sons have been identified for its failure. The analysis which I -as a European 
particle physicist- find most convincing has been summarised by Michael Rior- 
dan [624, 638]. He argues that the daring decision by a single nation, however 
powerful, to build a collider -twenty times as large and twenty times as costly 
as the existing Tevatron at the Fermi National Laboratory- in the Texan 
wilderness was just “a bridge too far.” He argues, that a more incremental 
approach, as the one chosen by CERN in Europe (see Figure 8.5), and rely- 
ing on an existing national centre like Fermilab with existing infrastructure 
and expert manpower would have had more chance of success. According to 
Riordan, a multilateral approach, inviting other nations to make major con- 
tributions and giving them a say in shaping the project, would have protected 
the project better from U.S. partisan politics. However, recent unilateral with- 
drawal of the U.S. from major international initiatives casts some doubt on 
this assumption. In addition to a slowing economy approaching a recession, 
Riordan identifies a clash of culture between physicists and managers from 
the military-industrial complex as a major problem, as well as the reluctance 
of high level physicists and engineers to venture a long time engagement in an 
unattractive area. 

The termination of the SSC left a deep shock in the U.S. particle physics 
community. As an immediate reaction, Steven Weinberg [474, p. 277ff] feared 
that with this cancellation, the U.S. seemed “permanently to be saying good- 
bye to any hope of ever having a responsible program of research in elementary 
particle physics.” Fortunately, this harsh judgement turned out to be overly 
pessimistic. In the aftermath, U.S. particle physicists defined a vigorous pro- 
gram of neutrino, heavy flavour and heavy ion physics at accelerators, as well 
a rich non-accelerator particle physics agenda. Moreover, U.S. physicists have 
an important participation in experiments at the European LHC. 

The cancellation of the SSC indeed gave a push to CERN’s less ambitious 
LHC project, which had been selected unanimously in 1991 by its governing 
body, the CERN Council, in which the 23 member states have equal votes. 
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The full scientific, technical and financial proposal was presented by the desig- 
nated CERN director general Chris Llewellyn-Smith and approved in Decem- 
ber 1993. In addition to its less ambitious goal of 14+14 TeV centre-of-mass 
energy, LHC’s advantage in terms of cost was the use of the pre-existing engi- 
neering infrastructure and the 27 km long underground cavern of the Large 
Electron-Positron Collider. The construction was completed in 2008 under the 
leadership of its Welsh project director, Lyn Evans [577]. The LHC costed 4.6 
billion Swiss Francs to build, equivalent to about the same sum in U.S. dollars. 
It produced first beams in August 2008. A month later, a faulty connection 
in one of its superconducting magnets caused a damaging quench, explosively 
releasing helium from the magnet cold mass into the tunnel. Nobody was hurt, 
but the real start-up was delayed by about a year to prevent similar incidents. 
The LHC has been running according to schedule ever since, including regular 
upgrades. 

In contrast to abandoned accelerator projects, failures of particle physics 
experiments are less visible. To my knowledge, none associated with large 
accelerator programs have failed because of technical or financial problems. 
This may well be because when an experiment fails, the participating scientists 
are basically the only victims. This is a fabulous incentive to overcome tech- 
nological or financial crisis. Also, regular peer review prevents major problems 
to pass below the radar. 

Non-accelerator projects are more vulnerable. An example of a major inci- 
dent is the 2001 accident at the Super-Kamiokande neutrino observatory in 
Japan. After maintenance work, for which the 50,000 ton water tank of this 
giant Cherenkov detector had been emptied, one of the 11,200 photomultiplier 
tubes imploded, sending out a shock wave that destroyed half of the tubes!®. 
Almost immediately, the director of the Kamioka laboratory, where the detec- 
tor is housed, reassured the scientific community [526]: “We will rebuild the 
detector. There is no question.” Indeed it restarted physics a year later with 
half the density of tubes, and in 2006 with a fully equipped detector. 

Large technological and operational risks are of course encountered by bal- 
loon and space experiments [612]. During launch, mechanical stress applies, 
which sensitive particle detectors do not always withstand. In the atmo- 
sphere, weather is unpredictable and so are balloon trajectories. In space, 
temperature excursions are very large and the radiation environment is harsh. 
Thus numerous probes and satellites have been lost or failed to deliver the 
expected data!”. Repairs in orbit, as e.g. done for the Hubble Space Telescope 
(https://hubblesite.org) and the AMS cosmic ray observatory on the ISS 
(see Focus Box 10.3), are a very rare exception. 

What is far more interesting is the case of experiments or whole pro- 
grams which technically worked, but did not have the expected outcome. It is 


16For a complete account of the accident, see http: //www-sk.icrr.u-tokyo.ac.jp/cause- 
-committee/1st/report-nov22e. pdf 

17For a list see e.g. https: //en.wikipedia. org/wiki/Timeline_of_artificial_satellites- 
-and_space_probes 
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Figure 8.8 Mismatch between original purpose and actual discoveries made 
with major particle physics infrastructure, as seen by Nobel laureate Samuel 
C.C. Ting. (Credit: S.C.C. Ting) 


after all the purpose of scientific activity to probe the unknown and discover 
what nobody had thought of before. Nobel laureate Samuel C.C. Ting always 
makes this point, as shown in Figure 8.8. Thus rarely the actual outcome of 
experiments figures in the original proposal. These unexpected outcomes of 
experimental programs are of course not failures but the very definition of 
successful experimentation. 
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CHAPTER 9 


The Standard Model of 
matter and forces 


The Standard Model is expressed in equations governing the vari- 
ous fields, but cannot be deduced from mathematics alone. Nor does it 
follow straightforwardly from observation of nature... Nor can the Stan- 
dard Model be deduced from philosophical preconceptions. Rather, the 
Standard Model is a product of guesswork, guided by aesthetic judg- 
ment, and validated by the success of many of its predictions. Though 
the Standard Model has many unexplained aspects, we expect that at 
least some of these features will be explained by whatever deeper theory 


ds it. 
a S. Weinberg, To Explain the World, 2015 [628] 


strong interactions and its vacuum. This is the piece de resistance of the 


I N THIS CHAPTER I describe the Standard Model of matter, electroweak and 
ook. 


9.1 MATTER 


In the beginning of the 1930s, electron and positron were the only known lep- 
tons. In the years 1936/37, Anderson and Neddermeyer installed their spec- 
trometer on Pike’s Peak, with an altitude of 4300m one of the highest summits 
in the Rocky Mountains. There they identified a charged particle with a mass 
between that of the electron and that of the proton and suggestively called 
it “mesotron” [269, 273]. Shortly after, Street and Stevenson [277] confirmed 
their discovery of the new particle. It was wrongly conjectured at the time to 
be the force particle that should transmit the nuclear force according to Hideki 
Yukawa’s trial field theory [268]. Consequently it was wrongly categorised as a 
meson [446]. In the group of Marcello Conversi in Rome, as well as by Evan J. 
Williams and George E. Roberts at University College of Wales, the decay of 
the new particle was first observed [490]. Under the most difficult conditions 
of World War II, Conversi, Oreste Piccioni and Ettore Pancini determined 


173 


174 m Particles, Fields, Space-Time 


its lifetime of about 2 us. However, their findings were not generally known 
until after the war [303, 309]. After Enrico Fermi’s emigration to the U.S., 
Franco Rasetti, a close collaborator of his, went to Canada in 1938. He mea- 
sured, in parallel to Conversi’s group, the lifetime of “mesotrons” stopped in 
matter [294, 592]. After the war, Conversi and his colleagues showed that the 
muon had nothing to do with nuclear forces [309]. It was thus recognised as a 
heavy “brother” of the electron, the name mesotron disappeared in favour of 
the less suggestive muon (u). However, its is not at all an excited state of the 
electron, as the absence of the radiative decay u” — e”y shows. 

The neutral leptons, i.e. neutrinos (v), are much harder to detect, since 
they only interact via weak interactions (see Section 9.2 below). The existence 
of such a rarely interacting particle had been boldly conjectured by Wolfgang 
Pauli in 1930, based on the requirement of energy and angular momentum 
conservation in beta-decays of nuclei with atomic number A and charge Z, 
(A,Z) > (A, Z + 1)e”v.. He had called it neutron, the real neutron had not 
been discovered yet (see Section 5.5). In his famous letter to the “dear radioac- 
tive ladies and gentlemen”! he had also predicted that the particle would be 
hard to find. Indeed the cross sections of neutrinos with matter are the smallest 
of all elementary particles, such that a neutrino can easily traverse Earth with- 
out interacting. It is thus no wonder that their discovery had to wait for high 
neutrino fluxes and massive detectors to become available. The first idea came 
from nuclear bomb research at Los Alamos [492] (see Section 6.4). Frederick 
Reines and Clyde Cowan had realised that nuclear beta decay can be reversed 
into the reaction d.(A,Z) > (A, Z — 1)et, leading to a detectable positron. 
When the reaction takes place on a hydrogen nucleus, Pep — net, also the 
final state neutron can be used. Their first plan sounds crazy to us today: they 
considered using neutrinos released in the test of a nuclear bomb. In reality 
and more realistically, the nuclear reactors at Hanford and Savannah River 
were used as neutrino sources. The experiment required massive detectors, 
capable of detecting electron-positron annihilation into photons, ete > yy, 
as well as neutron capture in a nucleus with subsequent photon emission. This 
became possible with the development of liquid scintillators, organic materials 
in solution which react to passing ionising particles by emitting short pulses 
of light. They have a high hydrogen content and were originally also devel- 
oped for military purposes. Counting neutrino reactions against the intense 
background coming from the nuclear reactor and cosmic rays was difficult. 
It succeeded because the signal from neutron capture is delayed by a few 
microseconds from the positron signal. In addition, random coincidences with 
an even longer delay give a direct measure of the background. This procedure, 
called side-band subtraction, is still routinely used today to identify small sig- 
nals in a large background. In June 1956, Reines and Cowan communicated 


1A copy of the letter in German can be found at cds.cern.ch/record/83282/files/- 
meitner_0393.pdf 
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the evidence for neutrino reactions in a telegram to Wolfgang Pauli. In July 
a short publication followed [332]. 

Muon neutrinos and antineutrinos, neutral leptons of the second gen- 
eration, are produced in the decay chain of charged pions: nr” — ph Dy; 
u > e@ ev, (resp. mt —> u"v; wt — etvev,). Similar processes hap- 
pen in the decays of charged kaons. The search for their interactions required 
powerful proton accelerators with the necessary intensity to create massive 
amounts of pions and kaons. The first available was the Brookhaven AGN, 
where a focalised neutrino beam was first constructed in the early 1960s. 
Since neutrinos are electrically neutral, they cannot themselves be focalised 
to increase the flux. Instead, following ideas by Bruno Pontecorvo [344] and 
Mel Schwartz [348], one can steer the proton beam towards a beryllium target 
and focalise the pions and kaons, which are produced in large numbers. One 
then lets them decay. A thick metal absorber stops all leftover particles but 
the barely interacting neutrinos. In Brookhaven, steel plates from a scrapped 
military vessel with a total thickness of 13.5m were used. A massive detector 
-in Brookhaven a sandwich of aluminium plates, spark chambers and scintil- 
lators with a total mass of 100 tons- allowed detecting interactions of muon 
neutrinos with nucleons, v,N — u” X, where N denotes a proton or neutron 
and X an arbitrary hadronic final state. Details of the hadronic final state 
in fact don’t matter, detecting the rather penetrating muon suffices to show 
that the reaction is not due to electron neutrinos or hadrons. In summer of 
1962, Leon Lederman, Schwartz and Jack Steinberger published first results 
on the discovery of muon neutrino interactions [357]. The three were awarded 
the Nobel prize in physics of 1988. 

Bruno Pontecorvo, one of the founding fathers of neutrino physics, was a fas- 
cinating physicists and a somewhat mysterious person [616]. Having worked for 
the British Atomic Energy Research Establishment in Oxfordshire, he defected 
to the Soviet Union in 1950 and disappeared from public view for over five 
months. There have been speculations ever since about him having been a Soviet 
spy (see Section 6.5), an accusation he always denied. He did most of his best 
work while in Dubna, at the Joint Institute for Nuclear Research. He died of 
Parkinson’s disease in 1993, shortly after the end of the Cold War. 

The high mass resolution of bubble chambers and the increasing energy 
reach of accelerators led to an explosion in the number of newly detected 
hadrons and resonances? in the 1950s and 1960s. Putting order into the richer 
and richer spectrum of particles was badly needed. Starting in 1957, Arthur 
H. Rosenfeld’, Walter H. Barkas, Murray Gell-Mann [339, 371] and Matts 


2 A resonance is a short lived hadronic state, seen as a broad peak in the mass distribution 
of its decay products. Other methods to detect resonances are the Dalitz plot and the Argand 
diagram. 

3A first edition of the Rosenfeld table was a report from the University of California 
Radiation Laboratory, W.H. Barkas and A.H. Rosenfeld, Data for Elementary Particle 
Physics, UCRL-8030, which was updated several time and published in Review of Modern 
Physics from 1964 onward. 
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Roos [368, 382] did pioneering work in this field editing the so-called Rosen- 
feld tables. They listed the measured properties of all known particles, with 
experimental details and systematic averaging wherever more than one mea- 
surement was available. Rosenfeld was Enrico Fermi’s last PhD student. Roos, 
a theoretician mastering statistical methods, played an important role in devel- 
oping well founded methods of kinematic and statistical analysis‘ in particle 
physics. The Rosenfeld tables developed into the Review of Particle Properties 
(RPP), which was published for the first time in 1968. Brought together by 
an international network, the Particle Data Group (http://pdg.1bl.gov/), 
the Review has grown today to a volume of more than 2000 pages [658]. It 
is not only the generally accepted reference for carefully and critically edited 
results concerning particle properties, but also a comprehensive reference for 
the methodology of particle physics. Since 1995, it is thus rightly called the 
Review of Particle Physics. Its pocket edition, which started as a wallet card 
in 1957 and is now the Particle Physics Booklet of about 300 pages, has been 
the daily companion of several generations of particle physicists. 
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Figure 9.1 Left: Octet of baryons with spin 1/2 (left) and decuplet of baryons 
with spin 3/2 (right). The particles are ordered according to their strong 
isospin (horizontal axis) and their strangeness (vertical axis). The discovery 
of the Q7 was a triumph of the Eightfold Way. 


An organising pattern in the properties of mesons and baryons was found 
in group theory. Groups are formed by transformations, if their sequential 
application always leads to another member of the group. Lie groups are 
those where transformations in an infinitesimal neighbourhood to the unity 
transformation -the one which changes nothing- define the properties of the 
whole group. Translations and rotations in three-dimensional space are com- 


4See the standard reference book by W.T. Eadie, D. Drijard, F.E. James, M. Roos and B. 
Sadoulet, Statistical Methods in Experimental Physics, North Holland, Amsterdam, 1971, 
current edition by Frederick James [563]. 
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mon examples of Lie groups, for which all transformations can be decomposed 
into a series of infinitesimal ones. Hadrons with the same spin and similar 
masses can be classified according to their quantum numbers strangeness and 
strong isospin? [354, 358, 365, 366]. The former distinguishes strange hadrons, 
like A or K, from their non-strange siblings, like n and 7. The latter has math- 
ematical properties analogous to spin, it is a two-component vector defined 
by its length J and its third component /3. It distinguishes e.g. the proton 
with Iz = +1/2 from the neutron with I3 = —1/2. Transformations which 
form the group SU(3), relate members of a so-called representation of the 
group; their properties are similar. The lightest baryons with spin 1/2 and 
with or without strangeness belong to an octet or singlet representation of 
SU(3), those with spin 3/2 to a decuplet, as shown in Figure 9.1. Inspired by 
buddhist ethics, Murray Gell-Mann baptised this order scheme the “Eightfold 
Way” [351], based on the irreducible representations of SU(3)°. 
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Figure 9.2 Octet of light mesons. 


The triply strange baryon Q7 was still missing when the order scheme 
was proposed; it was discovered in 1964 at the 30 GeV proton synchrotron 
of the Brookhaven National Laboratory, again with a hydrogen-filled bubble 
chamber [372]. Herwig Schopper, who became the CERN Director General 
from 1981 to 1988, wrote about this discovery [376]: “This time it is not just 
any new particle, but the discovery of the QT is an experimentum crucis for 
a new theory, which could become for the systematics of elementary particles 
what the Balmer formula was for the structure of atoms.” There is of course 
also an analogy to the prediction and discovery of chemical elements based on 
Mendeleev’s periodic system. 

Strong interactions, which hold together hadrons, are insensitive to isospin 
and strangeness, such that masses in a multiplet are similar. They would 
even be equal, if the strong SU(3) symmetry were exact. This suggests the 


5This strong isospin is not to be confused with the weak isospin we will introduce in 
Section 9.2. 

6Irreducible representations are those which cannot be decomposed into less populated 
ones. Examples are 3 x 3 x 3 = 10 + 8 + 8 + 1 (Fig. 9.1) and 3 x 3 = 8 + 1 (Fig. 9.2). 
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hypothesis that the observed symmetry properties are due to a substruc- 
ture of light mesons and baryons. This is the basis of the quark model, 
simultaneously invented by Murray Gell-Mann [373] and the CERN theorist 
George Zweig [384] in 1964. The name quark is borrowed from the powerful 
creative language of James Joyce’s “Finnegans Wake”’. In its original form, 
the model is based on a triplet of quarks, called up, u = (1,0,0), down, 
d = (0,1,0) and strange, s = (0,0,1). The coordinates denote the position in 
an abstract space of strong isospin and strangeness, which today count among 
the more numerous flavour quantum numbers. The antiquarks ü, d and § have 
the same, but negative quantum numbers. Rotations in this abstract space, 
which do not change the number of quarks, belong to the group SU(3). 

The electric charge is +3e for the up quark and — ze for the two others; all 
three are fermions with spin 3. With these settings, all baryons of Figure 9.1 
can be composed by three quarks. As an example, the proton is a state con- 
sisting of two up and one down quark, (uud), the neutron has one up and two 
down quarks, (udd). The mesons of Figure 9.2 consist of a quark-antiquark 
pair, like the 7+ of an up and an anti-down quark, (ud), or the K+ of an up 
and an anti-strange quark, (us). 

While the quark model was an elegant order scheme for the hadrons known 
at the end of the 1960s, its interpretation as a model for the internal structure 
of hadrons was far from generally accepted. The scepticism was due to the 
fact that the dynamics binding quarks together into hadrons was unknown. 
The direct proof that quarks are real and not just a fiction of group theory, 
was assumed to be their observation as free particles. The signature is clear: 
they are distinguished from ordinary matter by their fractional electric charge. 
However, all attempts to find free quarks were fruitless; a concise overview of 
the methods and results of these searches is e.g. given in [550]. 

Instead the proof that hadrons really have a material substructure was 
left to scattering experiments à la Rutherford, but with electrons instead of 
a particles as probes. The reason is that the interactions of high energy elec- 
trons with charged particles were well understood and calculable thanks to 
Quantum Electrodynamics. The ones of nuclei were not. In addition, point-like 
particles like electrons can probe deeply into protons and neutrons, without 
being themselves subject to strong interactions. 

Experiments at the Stanford Linear Accelerator Center (SLAC) in Cali- 
fornia were of fundamental importance for this research into the structure of 
nucleons. In the early 1950s, Robert Hofstadter, then a young associate pro- 
fessor at Stanford University, initiated a program on electron scattering off 
heavy and light nuclei. He used electron beams of a few hundred MeV energy 
and a one-arm spectrometer rotatable around a target to identify and measure 
the outgoing electron. In 1954, first experiments with a hydrogen gas target 
were conducted [353]. 


‘The fourth chapter begins with a poem: “Three quarks for Muster Mark! / Sure he 
hasn’t got much of a bark / And sure any he has it’s all beside the mark.” Thus quarks owe 
their name to an alliteration. 
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We consider the elastic scattering of electrons off protons, e~ p — e”p. Let us 
start with the simplest case, the elastic scattering of a relativistic electron off 
a scalar, point-like target with charge Z. In QED, the cross section for this 
relativistic process is described by the so-called Mott cross section [232, 251]: 


do a E al 
(2) = JF? |! sin J 
Mott sn 5 
Here E and E’ are the energies of the incoming and outgoing electron, M 
the target mass and @ the scattering angle. The fine structure constant a = 
e?/(4r) determines the order of magnitude of the cross section. The first term 
reminds us of the Rutherford cross section of Focus Box 4.5. Its denominator 


results from the photon propagator, inversely proportional to the square of 
the exchanged four-momentum q: 


1 1 1 1 


= m 


q? (K-k)2 XEE'-kk) 4E?sin? $ 


where k and k’ are the four-momenta of incoming and outgoing electron. The 
second term, E’/E = 1/(1 + 2E/M sin? 0/2), takes into account the recoil of 
the target. The third term takes into account the electron spin, i.e. the conser- 
vation of its helicity. Helicity is the projection of spin on the momentum vector 
of a particle. It is positive for spin oriented in the direction of motion, nega- 
tive otherwise. Helicity conservation requires that the cross section vanishes 
for backscattering, 0 = m. If the target is a fermion, with magnetic moment 
u = e/2M, a magnetic term is added [316], which only becomes important in 
the relativistic limit: 


do E' cos? 0 e sin? 9 
= ' in 
Wiot E Dam 


The three terms thus describe static and point-like target; target recoil; and 
spin effects, respectively. The last term is especially important. It has two 
concurrent angular dependences, one proportional to cos? 6/2, the other to 
sin? 9/2. We will find back these two terms when the target has an internal 
structure. 


a 


4E? sin? $ 


Focus Box 9.1: Electron scattering off a structureless target 
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Figure 9.3 The SLAC 8GeV electron spectrometer [469]. The electron beam 
enters from the left and hits a target of liquid hydrogen. The beam intensity 
is measured by an upstream toroid coil. The outgoing electron is identified 
and its momentum measured in the downstream spectrometer arm. This arm 
can be rotated around the target to cover different scattering angles. 


At small electron energies, less than about 1 GeV, and small momentum 
transfers from electron to nucleon, the target particle stays intact and the 
scattering is elastic. The scattering cross section then resembles the one for 
Rutherford scattering, except that the recoil of the target and the spins of 
both reaction partners have to be taken into account, as detailed in Focus 
Box 9.1. The interaction takes place with the charge distribution of the target 
as a whole. If the target is not point-like but extended, the elastic cross sec- 
tion is diminished in a characteristic way, by what is called a form factor. As 
explained in Focus Box 9.2, the form factor can be measured from the angular 
distribution, and converted into a measure of the size of the charge distribu- 
tion. When the electron is relativistic, also the magnetic moment of the target 
influences the cross section; there is then a second form factor related to the 
distribution of the magnetic moment inside the nucleon. For the proton, both 
correspond to a size of about 0.7 x 10715m, close to 1 fm [331, 336]. Modern 
precision measurements give a larger value of 0.831 x 10~!°m [671]. For the 
neutron, the magnetic form factor indicates a similar size for the distribution 
of its magnetic moment. For this first look into the structure of the nucleon, 
Hofstadter shared the 1961 Nobel prize in physics with Ludwig Mössbauer. 

Alternatively, the proton radius can be determined from the atomic spectra 
of hydrogen, since ground state electrons traverse the nucleus all the time. For 
normal hydrogen, values different from those quoted above are obtained [652, 
669]. When the atomic electron is replaced by a muon, which supposedly gives 
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more precise values, spectroscopy results agree with the scattering result [580]. 
New spectroscopy experiments are under way at Paul Scherrer Institute in 
Switzerland to resolve this discrepancy. 


Here we extend the discussion of Focus Box 9.1 to targets which are not point- 
like, and to nucleons in general. If the target has a finite size, its charge is 
distributed over space, such that the cross section is diminished by a so-called 
form factor |F'(q?)|?, a function of the four-momentum transfer q between 
projectile and target. In non-relativistic approximation, the form factor is the 
Fourrier transform of the spatial charge distribution p(T) : 


F(gQ) = fer na®: =1- „ja? (r?) +... 


On the right is an approximation for small momentum transfers to a spher- 
ically symmetric charge distribution with a mean quadratic extension (r?). 
If the charge distribution is exponential, p(r) ~ exp(—Ar), one finds a form 
factor F(\@]) ~ (1 + |q?/A?)~?. For any shape, the form factor diminishes 
rapidly with increasing momentum transfer. 

Things are a bit more complex if we consider spin and a charge distribu- 
tion which is not static. Interactions are not limited to Coulomb scattering, 
but magnetic effects come in. Of course, the role of “electric” and “mag- 
netic” terms depends on the reference frame. In the laboratory system, one 
finds [316]: 


2 E' (G2 G? 0 0 
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with the parameter T = —q?/(4M?), depending on the ratio of the four- 
momentum transfer q and the target mass M. One somewhat sloppily calls Gg 
and Gy the electric and magnetic form factors of the target. Experimentally, 
the form G ~ (1+ |q|?/A?)~ describes both form factors of the proton rather 
well, with a common parameter A ~ 0.84 GeV. For the neutron, the electric 
form factor is compatible with zero, the magnetic one is roughly compatible 
with the one of the proton, multiplied by the neutron magnetic moment. 


do 
dQ 


Focus Box 9.2: Electron scattering off an extended target 


While elastic scattering leaves the target intact, its cross section falls off 
quickly with increasing momentum transfer from the electron to the target. 
At larger momentum transfer, inelastic scattering sets in. If the nucleon has 
internal structure, a short lived excited state can be produced. This happens 
when the photon probing the nucleus has a wavelength comparable to the 
nucleon size, i.e. at momentum transfers of the order of the nucleon mass, 
roughly 1 GeV. Excited states of the nucleon are called resonances, in analogy 
to other resonance phenomena in physics (see Focus Box 9.12). They occur 
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indeed when the photon resonates with an internal structure of the nucleon. 
An example is the reaction e~ p > At — pr? or — nrt. The resonance At 
itself is not observable due to its short lifetime, of the order of 107?4s, but it 
manifests itself in the invariant mass distribution of the final state particles. 
It has a broad distribution around the mass of the resonance, its width is the 
inverse lifetime, as explained in Focus Box 7.7. 

When one increases the electron energy and the momentum transfer fur- 
ther, beyond a few GeV, the photon exchanged between projectile and target 
has a wavelength much smaller than the size of a nucleon. It can thus resolve 
its internal structure. If quarks really exist inside the nucleon, the photon 
can probe them. The nucleon is broken up and many additional mesons or 
even baryon-antibaryon pairs are produced. The invariant mass of the final 
state is no longer a fixed quantity -neither the proton mass as in elastic scat- 
tering, nor the less precisely fixed resonance mass. It becomes an additional 
kinematic variable only limited by energy-momentum conservation. A useful 
choice of kinematic variables is the energy transfer from the electron to the 
nucleon, thus the photon energy, and the square of the four-momentum trans- 
fer, the squared photon mass. The latter is of course not zero since the photon 
is virtual. The cross section can then be expressed in a very general way as 
a function of two so-called structure functions, which are a function of both 
variables, as explained in Focus Box 9.3. 

Starting in the late 1960s, the SLAC linear accelerator provided the intense 
and high energy electron beams required for experiments at energies of a 
few GeV, and opened the path to reaching the required resolution. A series 
of experiments to measure elastic and inelastic scattering off hydrogen and 
deuterium nuclei were conducted between 1968 and 1973 [469]. The experi- 
mental approach was in opposition to the doctrine of S-matrix theory, which 
stated that the dominant processes at low momentum transfer between pro- 
jectile and target contained all the physics of the process. Since the S-matrix 
approach denies any substructure of hadrons, no probe was supposed to pene- 
trate them anyway. In contradiction to this ideology, the experiments at SLAC 
clearly demonstrated that protons and neutrons indeed have an inner struc- 
ture. Jerome Friedman, Henry Kendall und Richard Taylor were awarded the 
Nobel prize in physics for these seminal experiments in 1990. The experiments 
used a so-called one-arm spectrometer which only identified and measured the 
final state electron, as shown in Figure 9.3. The momentum vector of the initial 
state electron being known from the accelerator beam line, the final momen- 
tum vector suffices to calculate the kinematic properties of the hadronic final 
state, without actually measuring them. 

The structure function measurements at SLAC established the following 
picture of inelastic electron-nucleon scattering [468, 469]: 


e At moderate momentum transfer the cross section has a series of broad 
peaks, which correspond to short-lived resonances, excited states of the 
nucleon. 
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The kinematic region where the nucleon is broken up by a high energy photon 
is also called deep inelastic scattering. Not knowing the internal structure of 
the nucleon, one can parametrise the differential cross section with two terms, 
as for elastic scattering (see Focus Box 9.2): 


do 
dE' dQ 
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The two structure functions, W; and Wa, which take the role of the form 
factors, parametrise the unknown distribution and dynamics of the nucleon 
constituents, if it has any. They depend on the photon energy, v = E — E’, 
and its squared invariant mass, q?. The description requires two kinematic 
variables, since the hadronic final state no longer has a fixed mass; its total 
mass Mnad is related to these variables and the nucleon mass M by M, ts = 
M? +2Mv + q?. The structure functions are measured by decomposing the 
observed rate as a function of scattering angle 0 and electron energy F”. 

The great discovery at SLAC, in the late 1960s, was that starting at q? of 
a few GeV?, the structure functions do not depend on v and q? separately, 
but on their ratio x = q?/(2Mv), a dimensionless number between 0 and 1. 
The phenomenon was named scaling at that time, based on the very general 
observation that the variable xp; does not depend on any scale, neither of mass 
nor of energy. The structure functions show this behaviour, if the nucleon has a 
substructure of charged particles inside, i.e. if quarks are really its constituents. 
The photon can then interact elastically with those as sketched here: 


a Sala: ==; i xE,xp 


This interpretation is valid when the binding force between quarks becomes 
negligible with respect to the momentum transfer by the photon, and the 
photons interacts with quarks incoherently. To obey kinematical constraints, 
the photon must hit a quark which carries a fraction pg/py = zgj of the 
energy-momentum of the nucleon. Consequently, the cross section must be 
proportional to the probability to find such a quark. In the limit q? — ov, 
the structure functions then tend to be directly proportional to the quark 
momentum distributions, f;(x) = dp;/dx, for quarks of type i with charge e;: 


vW;(v,Q?) > Fo(x) = » efile ; MW,(v,Q?) > F(x) = zee) 


Structure function thus measure quark charges and momentum distributions. 


Focus Box 9.3: Deep inelastic electron-nucleon scattering 
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e If one increases the momentum transfer further, thus reducing the pho- 
ton wavelength, the cross section not only becomes a smooth function, 
it also only depends on a single kinematic variable instead of two. This 
variable is called Bjorken-x, zgj, after one of its inventors [401], James 
D. Bjorken, whom his friends and colleagues call BJ. It is a dimensionless 
real number which can take values between 0 and 1. 


A single kinematic variable immediately means that the scattering is elastic: 
the photon hits a constituent instead of the nucleon as a whole. The Bjorken 
variable has two roles. On the one hand, it describes the ratio between energy 
and mass of the exchanged photon. On the other, it denotes the fraction of the 
nucleon’s energy-momentum which the struck quark carries. This is due to the 
fact that energy-momentum can only be conserved if the two ratios agree. For 
Bjorken-x close to 1, the photon interacts with a quark which carries almost 
the full nucleon momentum. For values around E, it interacts with one out of 
three quarks sharing it equally. 

The hypothesis behind this interpretation of structure functions is that 
quarks behave like free particles when they are probed at high momentum 
transfer. One calls this hypothesis asymptotic freedom. It is a general property 
of a theory, if the coupling becomes small at small distances. We come back 
to this important property of strong interactions in Section 9.2. In summary, 
the experimental findings based on nucleon structure functions are: 


e The three so-called valence quarks, (uud) for the proton, (udd) for 
the neutron, together carry about half of the energy-momentum of the 
nucleon. The rest must be carried by the particles which bind quarks 
together. They are called gluons. 


e The mean fraction of the proton energy-momentum carried by u quarks 
is Zu ~ 0.36, for d quarks it is za ~ 0.18. In the neutron, the two values 
are interchanged. This agrees with expectations from the quark model, 
that quarks share their half of the nucleon energy-momentum roughly 
equally. The x-distributions are very asymmetric with long tails towards 
large values. 


e Large separation between quarks, which would be necessary to observe 
free quarks, cannot be realised, not even at extreme energy transfers 
much larger than the nucleon mass. Quarks are always bound in hadrons. 


This latter fact is complementary to asymptotic freedom, which says that at 
large momentum transfers, i.e. small distances, even bound quarks behave like 
free particles. The quark model does not explain the dynamics of quark bind- 
ing inside hadrons. This requires a dynamical theory of strong interactions, 
provided by Quantum Chromodynamics (see Section 9.2). The momentum 
distributions of quarks and gluons in the proton and neutron have been mea- 
sured with impressive precision [633] from 1992 to 2007 by the experiments 
H1 and ZEUS at the electron-proton collider HERA in Hamburg, Germany. 
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The results of the SLAC experiments in the late 1960s and similar findings 
with neutrino and muon scattering off nucleons convinced most physicists of 
the existence of substructure inside hadrons, if not necessarily of the exis- 
tence of quarks as real particles. However, well into the 1970s there remained 
an influential sceptical minority under the leadership of Werner Heisenberg. 
In a plenary talk at the annual meeting of the German Physical Society 
in 1975 [427] -incidentally the first physics conference I attended-, entitled 
“What is an elementary particle?” , Heisenberg made clear that he considered 
the quest for hadron constituents to be stillborn: “Here the question has obvi- 
ously been asked: what do protons consist of? But it has been forgotten, that 
the term ‘consist of’ makes sense only if one succeeds to dismantle the parti- 
cle, using a small amount of energy, into components with a mass much larger 
than this energy; otherwise, the word ‘consist of’ has lost its meaning.” This 
verdict corresponds to his deep conviction that only observables should enter 
into a theory of the physical world. And structure functions in his view did not 
qualify. Until the dynamics of the binding forces were not understood, one was 
not supposed to discuss constituents [427]: “One can exaggerate somewhat and 
say that good physics has been spoiled by bad philosophy.” Heisenberg’s opin- 
ion was that hadrons were to be understood as representations of a symmetry 
group; respecting certain conservation laws, any one could be transformed into 
any other, like the S-matrix has it. Quantum Chromodynamics, the field the- 
ory which precisely describes the dynamics of quarks and gluons but excludes 
the existence of free quarks, has completely refuted this ansatz. 

Admitting this exception, for most physicists at the end of the 1960s matter 
consisted of the three quarks up, down and strange, as well as four leptons, the 
charged electron and muon and the corresponding neutrinos, ve and vy. Yet in 
1964, James Bjorken and Sheldon Glashow [369] speculated that there ought 
to be a fourth quark reestablishing the symmetry between leptons and quarks. 
The explicit prediction in 1970 for this fourth quark, named charm, and its 
properties is due to Glashow, John Iliopoulos and Luciano Maiani [406]. In a 
big leap ahead, their aim was to show under which circumstance weak neutral 
interactions (which were not yet discovered!) could avoid changing the flavour 
quantum numbers of quarks. Such violations of flavour were already ruled out 
by experiments on weak decays of hadrons. 

Consequently, their prediction at first attracted little attention. That 
changed promptly, when in November of 1974 the groups of Samuel C.C. 
Ting at the Brookhaven AGN [419] and Burton Richter at SLAC [418] simul- 
taneously announced the discovery of a new heavy meson. Their two papers 
appeared next to each other in Physical Review Letters. Ting called the new 
particle J, Richter named it W. Since none of them can claim precedence, 
the meson is called J/W, the only particle with a double name. It is the 
ground state of a bound pair of charm-anticharm quarks. The second genera- 
tion of quarks was thus completed. With this so-called November revolution, 
the Eightfold Way arrived at its end, strong isospin and strangeness merged 
with charm into the flavour quantum numbers. Richter and Ting were awarded 
the 1976 Nobel prize in physics for their discovery. 
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As if that were not enough, a little later Martin L. Perl and his group at 
the SPEAR electron-positron collider discovered a third charged lepton, the 
tau [424]. Perl, the thesis advisor of Sam Ting, shared the 1995 Nobel prize 
with Frederick Reines. The evidence was a significant number of ee” annihi- 
lations, where the observed final state contained nothing but an electron and a 
muon of the opposite charge, in an apparent violation of the conserved lepton 
number. In addition, the final state showed an imbalance of energy momen- 
tum, indicating additional unobserved particles in the final state. It was thus 
clear that the reaction was indeed ete” > rtr — (etv.v,)(u Duv) such 
that the neutrinos, which escape detection, assure conservation of lepton num- 
bers and energy-momentum. The demonstration of tau neutrino interactions, 
however, had to wait another 25 years [529, 659] because of their rareness 
and minute cross section with matter. Anyway, nobody had really doubted 
the existence of a third neutrino since the experiments at the LEP electron- 
positron collider had determined the number of light neutrino species to be 
exactly three’. To reestablish the symmetry between quarks and leptons, a 
third generation of quarks also had to exist [413, 425]. The properties of these 
quarks were known, but not their masses, so their discovery was a matter of 
accelerator technology. The periodic system of quarks and leptons was finally 
completed —as far as we know today- with the discovery of the third genera- 
tion bottom quark in 1977 [431] and the top quark in 1995 [477, 478], both at 
the Fermi National Laboratory, after an intensive experimental search guided 
by theory. 


Quarks 


Figure 9.4 The periodic table of 
quarks and leptons as it is known 
today. Rough values of the masses 
are given above the particle symbol. 
Roman numbers denote the genera- 
tions. The twelve matter particles on 
the left are fermions, the four force 
carriers on the right are bosons. 


Bosons 


Leptons 


The periodic table of the quarks and leptons known today is shown in 
Figure 9.4. The left side shows the twelve constituents of matter, which we are 
entitled to take as elementary until further notice, i.e. point-like and without 
substructure. They are classified into three generations with increasing masses, 


8For a review of these results see [625]. 
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and four families. The family members behave in a similar way as far as 
interactions are concerned. The families are: 


e quarks with electric charge +3e, also called up-type quarks; 
e quarks with electric charge 3, also called down-type quarks; 
e neutrinos with electric charge 0; and 

e charged leptons with electric charge —e. 


For each particle there is an antiparticle with equal mass and opposite charges. 
The term charge not only denotes electric charge, but charges of all types, 
which belong to the other forces of the Standard Model, i.e. weak and strong 
charge. We will discuss these in the next Section. 

In many such representations of the periodic system of quarks and leptons, 
even those of the Particle Data Group, you will find the mixed states, Ve, v, 
and v+, which interact with the W-boson, instead of the “real” neutrinos of 
the three generations, vz, vm and vy. In the above table I list only eigenstates 
of the mass operator, i.e. particles which evolve in space-time with a velocity 
given by their energy-momentum”. Please excuse me for this somewhat pedan- 
tic detail. That weak interactions interact with mixtures of particles -instead 
of pure states- is a special feature of theirs which we will also discuss in the 
next section. 


9.2 FORCES 


Weak interactions are the only known forces which can transform quarks and 
leptons into different flavours, even across generations. It is thus understand- 
able that the periodic system of Figure 9.4 for elementary matter constituents 
emerged from the study of weak interactions. 

The first product of this force was observed by Henri Becquerel in 1896 (see 
Section 2.2), when he discovered the penetrating, negatively charged radiation 
which results from the beta decay of uranium salts. He read no less than 
seven papers to the French academy of sciences in 1896 and many more in the 
following years [107]. His work on radioactivity was recognised together with 
that of Marie and Pierre Curie with the Nobel prize in physics of 1903. 

Rutherford and his group developed the hypothesis that this beta radiation 
comes from the decay of the atomic nucleus. Measurements of the spectrum 
were made by Otto von Baeyer, Lise Meitner and Otto Hahn [137] in 1911 and 
more precise ones by James Chadwick three years later in Hans Geiger’s labo- 
ratory [153] at Physikalisch Technische Reichsanstalt in Berlin, using Geiger’s 
newly developed counter. They demonstrated convincingly that the spectrum 


°The upper limit on neutrino masses has recently been lowered to 1.1eV, with a pre- 
cise measurement of the endpoint of the electron spectrum from tritium 8 decay [682]. 
This determines an upper limit on m2 = |U.r|?m? + |Uem|? m3; + |Ver|?m?;. See Focus 
Box 9.14. 
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of this type of radiation did not have a sharp peak, as one would expect for a 
two-body decay. Rather it had a continuous distribution between zero and a 
maximum energy. We know today that this maximum energy is the mass differ- 
ence between neutron and proton, but the neutron was only discovered in 1932 
by the same James Chadwick [244]. The continuous spectrum led Wolfgang 
Pauli in 1930 to the daring hypothesis, that in nuclear beta decay an addi- 
tional, undetected particle was emitted. Edoardo Amaldi and Enrico Fermi 
named it neutrino because of its electric neutrality and small mass [452]. The 
negative decay product was identified by Maurice and Gertrude Goldhaber as 
identical to the atomic electron [312]. The beta decay of the neutron is thus 
the reaction n — pe” De. The cross sections of neutrinos are so tiny!” that the 
discovery of their interactions with matter had to wait for the availability of 
nuclear reactors as intense sources of this elusive particle (see Section 9.1). 

In 1934, Enrico Fermi formulated a theory of nuclear beta decay [260], 
closely following the example of early predecessors of quantum electrodynam- 
ics. In this theory, beta decay proceeds via a direct coupling of a nucleon 
current, which transforms the neutron into a proton, with a lepton current, 
which connects electron and electron antineutrino. No force particle for weak 
interactions was postulated. Thus no propagator enters the amplitude calcu- 
lation, rather a global constant represents the coupling, which is called the 
Fermi constant, Gr. As long as momentum transfers between nucleon and 
lepton current are small compared to the mass of the force quantum, this 
effective field theory makes rather precise predictions. However, it does not 
allow to go beyond first order in perturbation theory. 

Hideki Yukawa [268] extended the theory by including the exchange of a 
force quantum, with the corresponding propagator. To allow Fermi’s approxi- 
mation to work well, this particle had to be very heavy. At the time, “heavy” 
meant a mass of several GeV, in reality weak bosons weigh almost 100 GeV. 
Consequently, weak interactions theory stayed an effective one for a long time. 
What was astonishing —and still is— is the fact that weak interactions not only 
violate flavour symmetry, but even so fundamental ones as parity, also called 
mirror symmetry. The discrete parity transformation transforms three-vectors 
into their opposite, e.g. the position vector (x,y,z) > (—x, —y, —z). Instead, 
it keeps pseudo-vectors like spin untouched. The potential violation of parity 
symmetry was first discussed theoretically by Tsung-Dao Lee und Chen-Ning 
Yang [334] in 1956. A year later, parity violation in d-decay was experimen- 
tally demonstrated by Chien-Chung Wu [338], as explained in Focus Box 9.4. 
Lee and Yang obtained the Nobel prize in physics of 1957, Wu’s experimen- 
tal results were mentioned in the acceptance speech, but she was not among 
the awardees. As if parity violation was not enough of a disrespect for fun- 
damental symmetries, it turned out that even the symmetry between matter 
and antimatter, a combination of charge and space reversal called CP, is not 


10 Already in 1934 Bethe and Peierls calculated that neutrinos can easily cross the Earth 
without interacting [257]. 
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For their beta decay experiments, Chien-Chung Wu and collaborators used a 
heavy nucleus with strong nuclear spin, °°Co with J = 5. If one cools the sam- 
ple adiabatically in a strong magnetic field down to cryogenic temperatures, 
all spins end up aligned in the direction of the magnetic field. Beta decay of 
one of the nuclear neutrons can -respecting angular momentum conservation— 
proceed in the two ways sketched below: a right-handed electron can be emit- 
ted in the direction of the nuclear spin, or a left-handed electron is emitted 
opposite to it. The corresponding angular distributions are: 


ap aT ap 
=To (1+ — ; — | =r (1- 
a o(1+F) dQ}, o( 2) 


with the spin vector ë and the electron momentum vector p. Only the lat- 
ter distribution is observed, the electron is always emitted opposite to the 
direction of the nuclear spin. 

Conservation of parity would require that both configurations are realised 
with the same probability. Any deviation from equality would indicate a vio- 
lation of parity. That only configuration b) is observed means that parity 
is violated in a maximum way. Only left-handed fermions and right-handed 
antifermions participate in weak interaction. This does, however, not mean 
that Noether’s theorem is violated. Since parity is a discrete transformation, 
Noether’s theorem does not protect the corresponding quantum number. 


dV 


dQ 


ooa 60a r _ 
a) = > ro AR: b) 
J=4 J=4 


Focus Box 9.4: Wu experiment on parity violation 
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respected by weak interactions [380]. This violation of CP symmetry won the 
1980 Nobel prize for its discoverers, James Cronin and Val Fitch. In sum- 
mary, weak interactions only couple to left-handed fermions, i.e. those with 
spin opposite to the direction of motion, and right-handed antifermions, with 
spin parallel to momentum, but not exactly in the same way. 

A field theory for weak interactions exists since the work of Steven Wein- 
berg [396], Sheldon Glashow [352] and Abdus Salam [398]. In the introduction 
to his publication, Weinberg names his motivation: “Leptons interact only with 
photons, and with the intermediate bosons that presumably mediate weak 
interactions. What could be more natural than to unite these spin-one bosons 
into a multiplet of gauge fields?” The term “gauge” shows up again here, like it 
did in our discussion of electrodynamics. In order to understand why the free- 
dom to gauge potentials in a range of different ways is of fundamental impor- 
tance, we must discuss Emmy Noether’s theorem [176, 407]. The theorem 
says that for each invariance of a physical system under a group of continuous 
symmetry transformations, there is a conserved quantum number. Examples 
from classical physics are the connection between energy-momentum conser- 
vation on one hand, and invariance under translations in time and space on 
the other hand. And of course the gauge freedom of the electromagnetic field, 
which leads to conservation of the electric charge. These examples are detailed 
in Focus Box 9.6. 

The proof of this theorem is due to Amalie Emmy Noether, a genius math- 
ematician from the city of Erlangen in Germany and one of the first female 
students and lecturers at its university. Even though women could not be 
enrolled at the time, she was given permission in 1900 to attend lectures, 
including those by eminent mathematicians like Schwarzschild, Minkowski, 
Klein and Hilbert. In 1904 women were finally officially admitted. Noether 
was promoted in 1908 and habilitated in 1919, and was given an unpaid posi- 
tion as extraordinary professor. In 1933, she had to leave university like all 
other researchers of Jewish origin, and emigrated to the U.S., where she taught 
at Bryn Mawr College. She died, as Hermann Weyl [267] remarked, “at the 
summit of her mathematical creative power.” The European Physical Society 
has named its prize for extraordinary female physicists after this unique per- 
sonality. More about her biography and her contributions to physics can be 
found in [482]. 

The symmetry properties of physical systems are best discussed using the 
Lagrange formulation as explained in Focus Box 9.5 for classical mechanics and 
in Focus Box 9.7 for quantum fields. At the end of the 18th century, Joseph- 
Louis Lagrange invented a single scalar function [5], or operator in quantum 
physics, which contains all necessary information to predict the evolution of 
a system. The so-called Euler-Lagrange equations transform the function into 
equations of motion for the system. The interesting feature of the Lagrange 
function or operator is that its invariance under a given set of transformations 
is transmitted to the equations of motion. As shown in Focus Box 9.7, the 
Lagrange operators for free bosons and fermions are all scalar under Lorentz 
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A classical system can be characterised by a single scalar function, the 
Lagrange function, L(q,q,t), which was introduced by the Italian-French 
mathematician and astronomer Joseph-Louis Lagrange in 1788 [5]. It can 
depend on the generalised coordinates q, which do not need to be Carte- 
sian, velocities g and also explicitly on time t. The classical action S results 
from the Lagrange function by time integration: 


t2 
s= / dtL(q,4,t) 
t 


1 


The equations of motion result from Hamilton’s principle of minimal action, 
which postulates that S is stationary for the physical trajectory which the 
system will choose for its evolution between tı and ta: 


t2 
6S = af dtL(q,q) =0 
tı 


The variation symbol 6 denotes the difference between two possible trajecto- 
ries, such that the variation of coordinates is zero at the start and end point, 
dqi(t1) = ôqilt2) = 0. If L does not explicitly depend on time, one obtains 
from Hamilton’s principle the Euler-Lagrange equations: 


d (OL OL 0 

dt (=) qi u 
For conservative systems (see Focus Box 2.2), L is the difference between 
kinetic energy T and potential energy V,L=T-V. It thus has the dimension 
of an energy. The Euler-Lagrange equations connect the Lagrange function to 
unique equations of motion. It follows, that a transformation which leaves L 
invariant does not change the evolution of the system; it represents one of its 
symmetries. 
Let us take as an example a particle of mass m in a unidimensional system. 
Assuming a potential V (x), its Lagrange function is L = $ma* — V (x). The 
Euler-Lagrange equations then lead to the well-known Newtonian equations of 
motion for a conservative force F (see Focus Box 2.1), më = —OV/Ox = F(x). 


Focus Box 9.5: Lagrange mechanics 
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Noether’s theorem says that continuous symmetries of a system result in a 
conservation law for one of its properties. Energy-momentum conservation 
follows from the invariance of physical laws under translations in space and 
time. This fact is well known for classical systems and is best discussed in 
the framework of Lagrangian mechanics, introduced in Focus Box 9.5. Let us 
consider a conservative system, where the potential V only depends on the 
position vector q;(i = 1,2,3) of its ingredients, and is independent of their 
velocity vector q;. For those, the Lagrange function is L = T (i) — V (qi). A 
shift in coordinate position along an axis i does not change the kinetic energy 
T. The Euler-Lagrange equations thus simplify to: 


d (OT\ _ oV 
dt (=) u og; 

On the left side of the equation you see the time derivative of the momentum, 
on the right side the gradient of the potential, i.e. the force. If the potential 
does not explicitly depend on a given coordinate q;, the Lagrange function is 
invariant under shifts in this direction and the right hand side of this equation 
is zero. The corresponding component of the momentum is then a constant 
of motion. Translational symmetry of the system has led to momentum con- 
servation. In an analogous way, invariance against time shifts leads to energy 
conservation, invariance against rotations to conservation of angular momen- 
tum. 

The gauge freedom of the electromagnetic potential leads to the conserva- 
tion of charge. One can see that using a simple Gedankenexperiment. Two 
observers look at an electron at rest in an isolated box. One observer defines 
the electric potential at the position of the electron as zero volts, the other as 
5V, gauge freedom entitles them to do so. If one could thus make the electron 
disappear —without introducing any additional particle— the second observer 
would gain 5eV of energy, the first would gain nothing. So if energy is con- 
served, gauge freedom leads to the conservation of electric charge. Of course 
one can make the electron disappear by introducing a positron into the box, 
without violating energy conservation: for the first observer, this would require 
no energy (if done adiabatically), the second one would have to work against 
the potential. 


Focus Box 9.6: Noether’s theorem in classical physics 
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transformations. Consequently, the derived equations of motion are covariant 
and can describe relativistic particles. In the light of Noether’s theorem we 
can say that the invariance of the Lagrange function under Lorentz transfor- 
mations causes energy-momentum conservation in relativistic terms. 

Lagrange operators can be used to describe the interaction of particles. 
And again, the properties of the interaction follow from a symmetry princi- 
ple. The symmetries we have thus far considered concern the coordinates with 
which the fields are described, thus transformations of the type y(x) > y(x’). 
We find it natural that the choice of the coordinate system does not change 
the physics one describes. We will now consider changes to the fields them- 
selves, thus transformations of the type y(x) > W (x). A well-known example 
from classical physics are the gauge transformations of the electromagnetic 
potential, discussed in Focus Box 2.3 and already used when discussing elec- 
tromagnetic waves in Focus Box 2.4. It is shown there that there is a class of 
global transformations of the electromagnetic potential which leave the elec- 
tric and magnetic fields unchanged. They are called global since they concern 
all potentials alike, everywhere and for all times. Take the electric potential as 
an example: only potential differences matter, not the absolute values. Thus 
the 5V marked on your battery denote the difference in potential between the 
two poles; there is no absolute zero of the potential other than by convention. 
Focus Box 9.6 explains that this symmetry is responsible for the conservation 
of electric charge. 

However, the experiments à la Aharonov and Bohm, which we introduced 
in Focus Box 7.9, convinced us that electromagnetic potentials are not arbi- 
trary. They change the phase angle of matter fields. But since only phase differ- 
ences can be made observable by interference experiments, we are not worried 
about that. Noether’s theorem says that there ought to be a conserved quan- 
tity caused by this global symmetry. Focus Box 9.8 shows that it is the elec- 
tromagnetic four-vector current density which is conserved. The local charge 
density can only change when a divergent or convergent current is present; 
global gauge invariance leads to local charge conservation. Remember that the 
equations of motion of the fields, the Klein-Gordon and the Dirac equations, 
already followed from the requirement of covariance under Lorentz transfor- 
mations, which leads to the conservation of energy-momentum. Clearly, sym- 
metries have momentous consequences for a field theory. 

Modern field theories of fundamental interactions take this argument a 
step further. Until now, we have considered only global gauge changes to the 
potentials, leading to global phase changes of all matter fields. These stay 
unobservable, since only phase differences can be made apparent in interfer- 
ence experiments. The quadrature of the wave function makes the absolute 
phase disappear when the probability for finding the particle is calculated. 
Modern gauge theories make the more radical requirement that this is also 
true when the gauge of the potential is changed locally, leading to arbitrary 
phase changes of the matter fields in space and time. These are benign only, if 
we add a well defined interaction term to the Lagrange density. As shown in 
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The Lagrangian formulation of a quantum field theory allows summarising the 
properties of a system of particles by a single scalar operator, the Lagrange 
operator. The principle difference to Lagrange mechanics (see Focus Box 9.5) 
is that one no longer uses discrete coordinates q;(t) but continuous fields 
w(t, £). The field can be seen as a generalised coordinate at each value of its 
argument, the space-time coordinates. This then represents a system with an 
infinite number of degrees of freedom. The description of the system is based 
on the Lagrange density operator, L = L(Y, ð Y), which depends on the field 
w(x) and its four-gradient 0,,7)(a). The action integral thus becomes: 


t2 
ge f dt J PILU 
tı 


The inner integral changes the Lagrange density into the Lagrange operator, 
L = f g L(y, ð Y). Hamilton’s principle of minimal action determines the 
evolution of the system: 


ö [ate £i0,0,0) =0 


with initial and final conditions y(x) = 6y(a2) = 0. By the same reasoning 
as for classical systems, one obtains the Euler-Lagrange equations for a field 
theory: 


Z _» 0b 
Op" (Auth) 


They lead to the equations of motion for the fields, which inherit invariance 
properties from symmetries of the Lagrange operator. When £ is a scalar 
operator under Lorentz transformations, thus relativistically invariant, the 
equations of motion will be covariant. Examples: 


e The Lagrange density which leads to the Klein-Gordon equation (see 
Focus Box 7.2) for a free scalar boson is Ls=o = (3 p)” (OX) — m2vry. 


e For the spinor field of a free fermion, the Lagrange density Ls=1/2 = 
y (iqt, — m) yY, with W(x) = yt (x), leads to the Dirac equation (see 
Focus Box 7.4). 


e A free vector boson like the photon A, has a Lagrange density Ls=1 = 
—}F,,,F” with the field tensor F,» = 0,,A,—0,A, (see Focus Box 3.7). 


All terms above are products of four-vectors or scalars, and thus fulfil the 
invariance condition for Lorentz transformations. 


Focus Box 9.7: Lagrange formalism for free quantum fields 
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A global gauge transformation changes the phase of matter fields w: 
dla) > Wa) = 2 y (2) 


with a constant angle a # a(x) and the charge operator Q with Qy(z) = 
q(x). The same angle applies to all fields and all of space-time. We will 
show invariance for the example of a complex scalar field y(x). Its equation 
of motion follows from a Lagrange density (see Focus Box 9.7): 


L = (O,)* (Ob) — my 


The global gauge transformations form the Lie group U(1) -rotations around 
a single axis— defined by infinitesimal transformations: 


Y —> (1+ iaQ)y 


Invariance of the Lagrange density means that the change in £ is zero. For an 
electron: 


OL OL OL OL 
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where the dots indicate the complex conjugates of the first two terms. The 
first term and its complex conjugate disappear because of the Euler-Lagrange 
equation. Invariance thus requires that the second term and its complex con- 
jugate add up to zero for all values of a: 


aL a y 
a (eo. un j) =o 


The term in parentheses is the electromagnetic current density (see Focus 
Box 3.7): 


jt = —e (WOM — pa") 


It must thus be conserved, ð j” = 0. The time-like component, j° = 
-e2 (w*w) is the change in the local charge density ep, with p itself the 
particle number density. The spatial components, j= eV (w*w), form the 
vector of electromagnetic current density. The continuity equation 0,j" = 0 
thus means that the local charge density can only change when a divergent 
or convergent current of charges is present. Integrating over all space one sees 


that the total charge of the Universe is conserved, 4 = En fep@a =0. 


Focus Box 9.8: Global U(1) gauge symmetry and conserved current 
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Focus Box 9.9 it must have exactly the form corresponding to Maxwell’s equa- 
tions. To preserve local gauge invariance, photons must thus always accom- 
pany charged particles wherever they are, to react appropriately to local phase 
changes and keep them from becoming observable. One thus calls the photon 
a gauge boson. The Feynman graphs of elementary interactions between pho- 
tons and matter, which result from this requirement, are shown on the bottom 
of Figure 9.5. 


The Lagrange density of a free fermion (see Focus Box 9.7) is not invariant 
against local gauge transformations, w(x) > Y’ (x) = e*")Rıy(x), because of 
its term O,,. It becomes invariant if and only if one adds an interaction term: 


L= (itd, — M) Y + epy yA, 


The four-vector A, must transform like A, — A, + (1/e)ð a(x) to pre- 
serve invariance. We can thus identify A, as the electromagnetic four-vector 
potential, which has exactly this gauge freedom: the four-vector gradient of 
an arbitrary function can be added to it. Invariance is thus guaranteed if an 
electromagnetic potential is present wherever there is a charged particle. And 
the interaction between the two is not arbitrary. The electromagnetic current 
density must couple to the photon field just as Maxwell ’s equations have it. 
Without spoiling local gauge invariance we can add the kinetic term for the 
free photon and obtain the Lagrange density of Quantum Electrodynamics: 


_ _ 1 ’ 
Lorp = Y% (iO, — m) hb + epy pAn 7 FF" 


Thus all of QED can be derived from invariance requirements. 


Focus Box 9.9: Local U(1) gauge symmetry and electron-photon interaction 


One could discount this fact purely as an aesthetically pleasing feature, if 
it did not have far reaching consequences. In the beginning of the 1970s, Ger- 
ard ’t Hooft and Martinus “Tini” Veltman found a connection between gauge 
invariance and the renormalisability of a field theory. Renormalisation and 
regularisation are part of the systematic treatment of infinities which show 
up in field theories when higher order processes are included. They should in 
principle disappear when one sums over all orders of perturbation theory. In 
practical calculation they are absorbed into measured parameters like masses 
and coupling constants, which are known and finite. Veltman and ’t Hooft 
demonstrated that gauge theories with massless gauge bosons are renormal- 
isable. It was thus shown that gauge invariance leads to theories which are 
predictive, in principle up to arbitrary order and thus high precision, once 
their parameters are precisely measured. Veltman and his former graduate 
student ’t Hooft received the physics Nobel prize in 1999 for this important 
proof. The two men are of very different character. Veltman, outspoken and 
never shy of controversy, refused to tell their individual contributions apart 
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[500, p. 168]: “I once said to ’t Hooft that once you collaborate you forget 
about who invented what.” ’T Hooft, soft spoken but equally forceful, took 
offence and the two drifted apart. 

With this discovery, the interest of the theory community in field theories 
of weak and strong interactions was revived. A class of such theories had been 
proposed twenty years earlier by Chen-Ning Yang and Robert L. Mills from 
Brookhaven in a paper [328] which represents a highlight in the field, really 
a course on gauge theories in a nutshell. It would be superficial to think that 
renormalisability is just a practical property, allowing to increase precision in 
a perturbative approach. Rather, effective theories, like the one by Fermi for 
weak interactions, already show divergences in second order terms, justifying 
criticism against field theory on principle grounds. These arguments were now 
removed and the end of S-matrix theory was near. 
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Figure 9.5 Feynman graphs for the vertices of electromagnetic and weak inter- 
actions of leptons / and quarks q. Downwards from the top: charged weak inter- 
actions, neutral weak interactions and electromagnetic interactions. Because of 
their maximal parity violation, weak interactions only act on the left-handed 
components of the fields, indicated by the index L. Above the vertices the 
coupling constants are noted. 


Weinberg had already conjectured in 1967 [396], that his unified gauge 
theory of electromagnetic and weak interactions might be renormalisable. He 
classified matter and force particles in representations of the joint symme- 
try group SU(2)xU(1) which combines rotations around two and one axes, 


198 m Particles, Fields, Space-Time 


respectively. Here the transformations of SU(2) again change the phase of 
matter fields, but apply only to their left-handed components. The conse- 
quence is a conserved weak charge, called weak isospin (T,T3) in analogy to 
spin, which is explained in Focus Box 9.10. The U(1) transformations, which 
belong to electromagnetism, of course apply to the whole fields. 

Figure 9.5 shows the Feynman graphs of elementary weak interaction ver- 
tices together with those of electromagnetic interactions. They all conserve 
weak isospin as well as electric charge. The neutral interactions, mediated by 
the Z boson or photon y and also called neutral currents, do not change the 
charge or the flavour quantum number of matter particles. The charged weak 
interactions, interactions with the WF bosons, also called charged currents, 
change a charged lepton into its neutrino, e.g. 4T + Vp, or a quark of type 
up into a quark of type down, like c 4 s, not necessarily of the same genera- 
tion. The coupling constants of all three interaction types are of similar size. 
However, weak interactions are mediated by the heavy bosons W and Z with 
masses of the order of 100 GeV. Compared to the massless photon, ampli- 
tudes and cross sections are heavily suppressed by propagators, at least for 
momentum transfers much less than these masses. The three interactions are 
thus described by a unified theory, called electroweak theory, which reduces 
three couplings to one and a mixing angle, as shown in Focus Box 9.10. 


Figure 9.6 Feynman diagrams for typical electroweak interactions of lep- 
tons. From left to right: Electron-positron annihilation into a pair of muons 
(ete — yt); muon decay into an electron and two neutrinos (u~ > 
e Devu); elastic muon neutrino scattering off an electron (v„e” — vpe). 


Figure 9.6 shows Feynman graphs for typical leptonic electroweak inter- 
actions. Wherever initial and final state allow, like in the electron-positron 
annihilation of the left graph, photon as well as Z exchange contribute to 
the invariant amplitude. The cross section then has three terms, correspond- 
ing to the probabilities for the two mechanisms plus an interference term. 
Muon decay, shown in the middle graph, is the mother of all charged weak 
interactions; it serves today to determine the Fermi constant with high pre- 
cision!!. The rightmost process, neutrino-electron scattering, only proceeds 


11See the discussion by the Particle Data Group, http://pdg.1bl.gov/2019/reviews/ 
rpp2019-rev-standard-model . pdf, p. 3. 
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The Lagrange density of free fermions is invariant against global SU(2) trans- 
formations, as it was for U(1). The corresponding conserved quantity is the 
weak charge with two components, called weak isospin. In analogy to spin, 
weak isospin is specified by its absolute value T and its third component T3. 
The weak charges of leptons then are: 


& ermet. er T=0;T=0 

L 

Ea T=12;=4 N un T=0;T3=0 
L 

Vr = _ f +1/2 ee, = 

= ), mel, tr T=0; T=0 


This way, only the doublets of the left-handed field components have a weak 
charge, the singlets of right-handed components don’t, in agreement with max- 
imum parity violation. In the same way, the quark charges are: 


u BEINEN TEE 
eu r=1/2iti={ 4%, urn.dg T=0; T;=0 
c ie E an 
eal eae cr,sr T=0; T3=0 
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Local gauge invariance is ensured if the Lagrange density contains interac- 
tion terms between matter fields and three gauge bosons, (Z,W*~), which 
form a triplet of weak isospin, (W*,Z,W7) with T = 1, T; = (+1,0,-1). 
The coupling constants of weak charged (g), weak neutral (g’) and electro- 
magnetic interactions (e) are quite similar and connected by the unification 
relation e = g sin 0w = g’ cos Ow. It uses the so-called weak or Weinberg angle 
cos Ow = My /Mz, Ow > 30°. At small momentum transfer q?, like in weak 
decays, amplitudes are suppressed by propagator effects. Instead of 1/q? for 
the massless photon, the weak propagator is 1/(q? — M? +iMT), with masses 
Mz ~ 90GeV, Mw ~ 80GeV and widths Tz > 2.5 GeV and Tw = 2.0 GeV. 
Weak cross sections are thus small for q? < M?, with an effective coupling 
like in the Fermi theory: 


Gr g? 


va sm, 


At momentum transfers comparable to the boson masses, q? ~ M?, weak cross 
sections have a resonance pole and become much larger than electromagnetic 
ones. However there is an inherent problem. Local gauge invariance is only 


preserved if all gauge bosons are massless like the photon. But Z and WF 
clearly aren’t. 


Focus Box 9.10: SU(2) gauge symmetry and weak isospin 


200 m Particles, Fields, Space-Time 


via Z exchange, since the neutrino has no electric charge and cannot emit a 
photon. It has led to the discovery of neutral weak interactions. 

The electroweak gauge theory of Glashow, Weinberg and Salam thus con- 
verted Fermi’s effective theory of weak interactions into a renormalisable field 
theory. It predicted that parallel to electromagnetic interactions, there ought 
to be a neutral weak interaction which alters neither fermion charge Q, nor 
T3. It is called a weak neutral current, to distinguish it from the charged cur- 
rent interaction of neutron or muon decay. Excellent candidate experiments 
to check this prediction are neutrino interactions. Neutral currents were dis- 
covered using the heavy liquid bubble chamber Gargamelle in the intense 
neutrino beam at the CERN PS (see Section 8.3). Since neutrinos interact 
only weakly, the scattering of neutrinos off atomic electrons is essentially free 
of background. Ever since Simon Van der Meer and his collaborators had 
invented focussing by a magnetic horn, there was a muon-neutrino beam of 
sufficient intensity. Gargamelle with its heavy freon filling provided enough 
target mass to search for these rare processes. It was thus only a question 
of time and patience to prove or disprove electroweak theory by observing 
(or not!) the elastic scattering process v„e” — v,e~ on a bubble chamber 
picture!?. I remember that in the room at RWTH Aachen, where the scan- 
ning team analysed Gargamelle pictures, there was a check list, counting the 
weeks left until electroweak theory was proven wrong. Instead in 1973, the 
event reproduced in Figure 9.7 was found [411, 428]. The final state elec- 
tron is clearly identified by the minimum ionisation and logarithmic curling of 
its track; it also emits bremsstrahlung which converts into electron-positron 
pairs in the heavy liquid. Shortly afterwards, also neutral weak interactions 
of the neutrino with quarks were demonstrated [410, 421]. This hadronic part 
of weak interactions, however, was experimentally controversial, since there 
is a background process, neutron scattering off nuclei, which is unavoidably 
present and must be carefully controlled. This was first done successfully by 
Dieter Haidt using reactions, where neutron source and interaction are both 
visible on the same bubble chamber picture. He received the physics prize of 
the German Physical Society of 1975 for his method. The Gargamelle collab- 
oration received the High Energy and Particle Physics Prize of the European 
Physical Society in 2009 for their discovery. 

The view on the history of the discovery of hadronic weak neutral cur- 
rents is quite different on both sides of the Atlantic. While U.S. authors, like 
David B. Cline [471, 493] emphasise that the publications from the Gargamelle 
experiment at CERN and the concurrent HPWF experiment at Fermilab [415, 
416, 417] were almost simultaneous, European authors like Haidt and Donald 
Perkins [473, 497] take pride in claiming precedence for neutral weak interac- 
tions with both leptons and quarks. In any case, the discovery was promptly 


12To filter such rare events out of the many thousand (mostly empty!) bubble cham- 
ber pictures is an impressive achievement. The RWTH Aachen technical staff at the time 
consisted of: Sylvia Jakobs, Heidi Klamann, Christa Malms, Karin Pohlmann, Marlene 
Stommel, Hannelore von Hoegen and Gisela Weerts. 
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Figure 9.7 Bubble chamber picture of the first observed elastic muon neutrino 
scattering off an electron (v,e~ — v„e”). The neutrino enters from the left 
and hits an atomic electron projecting it to the right. The electron is seen as a 
minimum ionising particle. It emits a bremsstrahlung photon which converts 
into an electron-positron pair. The energy loss in the heavy liquid makes the 
electron then curl up logarithmically. 


confirmed by two independent experiments. However, the conflict may have 
influenced the decision of the Nobel committee to award the 1979 prize in 
physics to the three theorists Glashow, Weinberg and Salam and ignore exper- 
imentalists [436]. 

The discovery of Z and W bosons as real particles required the construc- 
tion of a new collider for protons and antiprotons at CERN. For this pur- 
pose, the Super-Proton-Synchrotron was converted into a proton-antiproton 
collider. The revolutionary method of stochastic cooling, again invented by 
Simon Van der Meer [437], allowed to obtain beams of smaller size and spread 
in energy. The experiments UA1 [444, 445] and UA2 [451, 447] reported the 
production and leptonic decay of both weak bosons in 1983. Van der Meer and 
Carlo Rubbia obtained the Nobel prize in 1984 for the concept and construc- 
tion of the collider. To further investigate the properties of the weak bosons, 
CERN decided even before their discovery to construct the Large Electron- 
Positron collider LEP with four experiments. From 1989 to 2000, these exper- 
iments have investigated weak interactions with unprecedented precision and 
found the predictions of electroweak theory brilliantly confirmed. In parallel, 
the SLAC Linear Collider SLC, providing data with polarised electron and 
positron beams, and the Tevatron storage ring at Fermilab, providing measure- 
ments of the mass of W boson and t quark, have contributed to these precision 
measurements. Since there was no controversy, encouraged by CERN, SLAC 
and Fermilab management, the experimental collaborations got together to 
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combine their results [561, 599]. As an example, Figure 9.8 shows the total 
cross section for the reaction ete” — uw” collected by several generations 
of electron-positron colliders [506], including the LEP experiment L3, together 
with the prediction of electroweak theory. The momentum transfer relevant 
here is the total centre-of-mass energy, ys (see the left graph of Figure 9.6). 
The electromagnetic process dominates at small momentum transfer, with its 
propagator term 1/s. At momentum transfers in the vicinity of the Z mass, 
neutral weak interactions dominate and lead to a gigantic peak in the cross 
section. Resonance phenomena like this are observed in all branches of physics, 
from acoustics and electrodynamics all the way to particle physics. They are 
thus a wonderful example for the unity of physics and nature in general. 
In Focus Boxes 9.11 and 9.12 we draw a parallel between the Z resonance 
and a damped mechanical oscillator. On the flanks of the resonance, there is 
notable interference between Z and photon interactions, destructive below and 
constructive above the resonance peak, which also has a mechanical analog. 
Interference shows up in particular in the asymmetry between scattering into 
the forward and the backward hemisphere, which is also shown in Figure 9.6. 
Experiments thus not only demonstrate the workings of the Z boson as the 
field quantum of neutral weak interactions, but also the validity of the super- 
position principle in quantum physics far below the atomic scale, and the wave 
nature of particle interactions. 
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Figure 9.8 Cross section (left) and forward-backward asymmetry (right) for 
the reaction ete” — u? u” measured at LEP and at lower energies [506]. The 
asymmetry and the overlaid theory curves are explained in Focus Box 9.11. 


With these measurements, the gauge theory of electroweak interactions 
is confirmed down to per mille level accuracy, if not better. But there is an 
inherent contradiction. Local gauge invariance can only be ensured for mass- 
less gauge bosons like the photon. Mass terms for gauge or matter particles 
break gauge symmetry. The Z and W bosons are not only massive, but really 
heavy; the top quarks weighs as much as a Hafnium nucleus. So why is elec- 
troweak theory still renormalisable? Is gauge theory a wrong track or are all 
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The cross section for ete” — u" according to electroweak theory has 
three terms, oto¢(ete” > uty) = 0,+02 + Cipt- The first term contains 
the square of the amplitude for photon exchange: 

do, 0? _ Ano? 


En u 29) . _ 
ae (1+cos?0) ; o a 


The second comes from Z exchange, the third describes the interference 
between the two. The angular distribution taking into account all three terms 
is: 

mo a? [Ao (1 + cos? 0) + Aj cos 6] 

dQ 43 | 
with the scattering angle 6 = Z(e~, u”). The first term is symmetric under 
cos + — cosd, the second is asymmetric. Their coefficients are Ag = 1+ 
2Re(r)g? + |r|? (9? +) and A; = 2 Re(r)g% + 8|r|?g?-g}. The pole term 
r contains the Breit-Wigner function, characteristic for the decay of heavy 
particles: 


V2GF sM2 
To 
e? s-M2+iMalz 
The couplings gy et ga come from the electroweak charges, gy = T3 — 


2Q sin? Ow and ga = T3, with the third component of weak isospin T3 and 
the electric charge Q. The index of these constants indicates that they refer 
to the vector and axialvector components of the weak neutral current. Pure 
electromagnetic interactions correspond to Ag = 1 and A; = 0. The purely 
weak terms are proportional to |r|? œ Ge JA? the interference terms to the real 
part Re(r) x g% ja: Major observables are the total cross section ø, where the 
asymmetric term cancels, and the forward-backward asymmetry Arg, which 
isolates the interference effect: 
O Ara? 


o 35 0 3; FB 


or -0n _ 3A: 
oF +0B ~ 8Ao 


with the partial cross sections or for forward (cos > 0) and og for back- 
ward (cos6 < 0) scattering. The asymmetry is negative for energies below 
the resonance, goes through zero at s = M? and becomes positive above. 
Experimental data for the cross section and the asymmetry as a function of 
the centre-of-mass energy are shown in Figure 9.8. 


Focus Box 9.11: The Z resonance in ete~ — ppt 
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A simple one-dimensional mechanical oscillator system is shown in the figure 
below on the left. A periodic force of amplitude Fo and angular frequency w 
excites a harmonic oscillator characerised by the spring constant k and the 
mass m. A damping device is seen on the bottom, with a friction coefficient 
b. The amplitude of the movement follows from Newton’s equation: 


dx dx 
—ka — en + Fo cos wt = Maa 


The solution is: 
Fo sin (wt — ¢) 


u ym? (w? — w2) + bw? 


Here & is the phase shift between the exciting force and the oscillation. The 
eigenfrequency wo = y k/m depends on the ratio of spring constant and mass. 
The two figures on the right show how the maximum elongation maz and the 
phase shift & depend on the ratio between exciting frequency and eigenfre- 
quency, w/wg. Comparison with the equations ruling the Z resonance and 
Figure 9.8 reveal the close analogy between the two processes. The resonance 
term has the same form. The cross section corresponds to the square of the 
mechanical amplitude, the sine of the phase shift to the angular asymmetry. 
The centre-of-mass energy corresponds to the exciting frequency, k to the cou- 
pling constant and 6 to the resonance width. All in all a nice example of the 
unity of physical laws in very different branches. 
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Focus Box 9.12: The damped mechanical oscillator 
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particles really massless? We come back to the resolution of this mystery in 
Section 9.3. 

Weak interactions break many symmetries which the other interactions 
preserve: 


e Charged weak interactions change the flavour quantum number. The 
W does not interact with the quarks and leptons of Figure 9.4 but 
with mixtures specified in Focus Boxes 9.13 and 9.14. These are gen- 
erated by rotations in flavour space, described by two unitary matrices, 
named after Nicola Cabibbo [356], Makoto Kobayashi and Toshihide 
Maskawa [413] for quarks, Bruno Pontecorvo [340], Ziro Maki, Masami 
Nakagawa and Shoichi Sakata [367] for leptons. These matrices rotate 
the eigenstates of the mass operator of Figure 9.4 into the mixtures 
which interact with the W bosons. Both are dominated by their diago- 
nal elements, but have non-negligible entries off the diagonal. The lat- 
ter lead to interactions among members of different generations, such 
as s > uW” or T — vyW-. Even rarer are processes which jump 
two generations. Mixtures are generated by weak interactions, but what 
propagates through space-time are the “real” particles, and with differ- 
ent velocities since they do not have the same mass. When their flavour 
is probed again, the mixture thus may have changed and flavour oscil- 
lations occur. 


e Weak interactions also do not respect parity, but violate its symmetry 
in a maximum way, as we have seen in Focus Box 9.4. They only interact 
with the left-handed part of matter fields. 


e Moreover, they also violate the symmetry between matter and antimat- 
ter, the so-called CP symmetry, but only a little bit this timet. This 
violation manifests itself in the decays of heavy quarks, in that parti- 
cles and antiparticles have slightly different decay properties. The elec- 
troweak theory describes this phenomenon by a complex phase in the 
mixing matrix (see Focus Box 9.13). For quarks it has been measured, 
not yet for leptons. 


In contrast to the photon as a carrier of electromagnetic interactions, W 
and Z carry themselves the charges responsible for electroweak interactions. 
The neutral boson has Q = T; = 0, but T = 1. The charged bosons have 
Q = T; = +1. For this reason, the electroweak bosons interact with themselves 
as the Feynman graphs of Figure 9.9 show. At the same time, these interactions 
fix the electroweak couplings of matter particles, and make them identical 
for all generations. The underlying reason for boson self-coupling is the non- 
Abelian nature of the group SU(2), i.e. the fact that rotations around two 
axes do not commute. 

This property of boson self-coupling is shared by the third force of the 
Standard Model, the strong force that binds quarks together into hadrons. 


13See http: //pdg.1b1. gov/2019/reviews/rpp2019-rev-cp-violation. pdf 
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Weak transitions between quarks (and leptons) happen predominantly but 
not exclusively among members of the same generation. Charged pion decay 
n* — u*v, is an example of the dominant type. Kaon decay K* — u*v, is 
a process involving quarks of the first and second generation. Otherwise they 
are very similar: 


u u 


d we 8 DE 


Vy Du 


Due to the much larger phase space for the kaon decay, one would expected 
a ratio of the partial widths T(K* + u*v,)/T(r* > utv,) = O(100). The 
observed ratio is of order 1 instead. One concludes that cross-generation cou- 
plings are very much disfavoured compared to intra-generation couplings, by 
at least a factor 10 in the amplitude. In 1962, Nicola Cabibbo proposed a 
way to describe this fact [356]. States which interact with the W~ are not the 
quarks which are listed in Figure 9.4, but mixtures, d’ = dcosdc + ssin dc 
and s’ = -dsindc +scosdc, using the Cabibbo angle cos 8c. The states d’, s’ 
and b’ are not eigenstates of the mass operator. Weak interactions mix a little 
bit of a strange quark into the wave function of a down quark and vice versa. 
Consequently, a factor of cos dc is included in the intra-generation amplitude. 
For inter-generation transitions, a factor of sinc appears. In the ratio of 
decay widths, we get a factor of tan? dc, from the measured rates 6c > 10°. 
Kobayashi and Maskawa enlarged the mechanism to three generations [413], 
introducing the Cabibbo-Kobayashi-Maskawa matrix: 


d Vad Vaus Vab d 
s |=| Va Ves Veo s 
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There is a hierarchy of elements: intra-generation elements dominate, those 
which jump a single generation are still appreciable, those which jump two 
are small. Methodology of the measurement and current values are found in 
http://pdg.1bl.gov/2019/reviews/rpp2019-rev-ckm-matrix.pdf. The 
mixing mechanism leads to oscillations between particle and antiparticle 
for neutral hadrons, like K? + K?, D° <> DV and B® 4 BP. Note that the 
matrix elements can be complex. There can thus be a complex phase which 
cannot be eliminated by redefining the flavour base. Such a phase violates 
the symmetry between matter and antimatter in the amplitudes where it 
appears, it is the way in which the Standard Model describes CP violation. 


Focus Box 9.13: Weakly interacting quark states 
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In the leptonic sector, weak interactions also introduce a mixing mechanism. 
The weak bosons do not interact with the mass eigenstates of Figure 9.4, but 
with superpositions: 


Ve Uer Uem Uen vL 
Va | =| Ust Unum Upn VM 
Ur Urt Urm UrH vH 


The matrix is called Pontecorvo-Maki-Nakagawa-Sakata matrix after its 
inventors. For an up-to-date review of the related methodologies, please see 
http://pdg.1bl.gov/2019/reviews/rpp2019-rev-neutrino-mixing. pdf. 
The superposition leads to oscillations between weak neutrino flavours if the 
masses of vz, Vm and vy are not the same. When a pure v, beam is produced, 
e.g. by weak pion or kaon decay, different mass eigenstates propagate with 
different velocity. When the beam composition is then probed, again by weak 
interactions, at a distance from the production point, the beam composition 
will have changed. Thus one observes e.g. Ve or v, interactions in a V, beam. 


Focus Box 9.14: Weakly interacting lepton states 
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Figure 9.9 Feynman graphs for the self coupling of electroweak bosons. 


Its unknown dynamics had kept Heisenberg from accepting the quark model. 
It was already clear from spectroscopy that quarks needed an additional new 
quantum number. A state made of three identical quarks without relative 
angular momentum and with total spin 3/2, like At*, is not possible without 
an additional quantum number which makes the total wave function asym- 
metric, as the Pauli principle requires. 

The gauge theory of strong interactions, quantum chromodynamics (QCD) 
takes this into account. It is based on the symmetry group SU(3) of rotations 
around three axes, again a non-Abelian group where transformations do not 
commute, as we explain in Focus Box 9.15. QCD has many fathers, a short 
account is found in [590], a more detailed one in [609]; an early reference 
is [412]. An interesting introduction very much centred on aesthetic arguments 
has been given by Frank Wilczek [629]. The global version of SU(3) symmetry 
causes the conservation of a three-component strong charge, called colour. 
One chooses the basic colours red (R), green (G) and blue (B) like in colour 
TV. A red quark, qr, has the red colour quantum number equal to 1, the 
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two others equal to zero, (1,0,0). An antiquark of the same kind carries an 
anticolour, Gm has a colour of (—1, 0,0). Electroweak interactions are colour 
blind, leptons and electroweak bosons do not carry a colour charge. We go into 
technical details in Focus Box 9.15. Bound states of coloured quarks have no 
net colour, in a symmetric way for mesons, an antisymmetric way for baryons. 
We can symbolically decompose their wave functions as follows: 


(qq) = (qrr + 4cde + 4845) 
(qqq) = (RIGB — IcdRdB + dBARIG — IRIBIG + IGIBIR — IBIGIR) 


The field quanta of strong interactions are called gluons since their role is to 
bind quarks into hadrons. Like the electroweak bosons, they carry the required 
charge to interact among themselves. In fact they carry a colour and and an 
anticolour; a red-antigreen gluon thus has colour charge (1,—1,0). When a 
quark emits or absorbs a gluon, its own colour charge changes as sketched in 
Figure 9.11. The total colour of the hadron always stays zero in this process. 
In particular proton and neutron thus have no net colour and cannot as a 
whole emit or absorb gluons. The nuclear force that binds them into nuclei 
is just a residual of the strong force, analogous to the tiny Van der Waals 
force that acts between electrically neutral atoms and molecules in gases and 
liquids. More information about properties of the nuclear force is found in 
Section 5.5. 


ug SRB 8GB SRG 
u SRG 


ERG &BG ERG 8GB 


Figure 9.10 Elementary vertices of QCD. The quark-gluon interactions is 
shown on the left, the couplings among three and four gluons on the right. 
The indices give examples of the colour charges involved. 


Figure 9.11 A moment in the life of the 


UR ug up positive pion. Its main ingredients are an 
up quark u and an anti-down quark d. 
gre a 8BG They bind by constantly exchanging glu- 


p = z ons, their colour charge changes but the 
net colour of the hadron always stays zero. 


Compared to electroweak couplings, the strong coupling constant g is 
rather large. It enters the scattering amplitude as a, = g?/(4r). This is the 
analog of the fine structure constant of QED, but at least one order of mag- 
nitude larger than its electromagnetic counterpart. Strong interactions thus 
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The phase transformations of the group SU(3): 


Wq(2) > Vi (a) = er (2) 


are generated by a linear combination of the eight Gell-Mann matrices 
Ap, k = 1...8. The Dirac equation is invariant under global transformations 
with constant coefficients a, # a(x). According to Noether’s theorem, there 
is a corresponding charge with three components, which are individually con- 
served. It is called colour, its basis can be chosen as red, green and blue: 


1 0 0 
Cr=| 0) Sell] = @=|0 
0 0 1 


A quark carries a single colour, the up-quark has thus three states, ug, ug et 
ug. Antiquarks carry an anticolour, like tg, ig and tg. Interactions between 
quarks involve a colour change by emitting or absorbing one of eight gluons. 
Gluons carry one colour and one anticolour each, with e.g. the combinations: 


1 1 ( 
v2 v6 
With three colours and anticolours one could have thought that there were 
nine gluons; however, the symmetric combination (RR + BB + GG)/v3 has 
no net colour and cannot interact. The remaining eight are spin-one bosons, 
neutral with respect to electroweak interactions and massless. If one imposes 


invariance under local SU(3) phase transformations, by a local angle a,(), 
the following Lagrange density results: 


RB, RG, BG,GR,GB, BR, —(RR - BB), RR+ BB -—2GG) 


Sh ats — Xk . 1 y 
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The first term describes the free quark field, the second its interaction with 
the eight gluon fields G$, as shown on the left of Figure 9.10. The last term 
involves the eight gluon field tensors Ch 


G, = (9,65 — 3 G$) = gf nj GG, 


with the SU(3) structure constants frjı. The terms in brackets remind us of 
the electromagnetic analog Fv, they describe the motion of the gluon field. 
The third term, quadratic in the gluon fields, leads to the self-interactions 
among gluons, shown on the right of Figure 9.10: 


g (3 G5) (GEG) ~G? ; g? (GIG) (er) ~ G* 


They have important consequences for the large distance behaviour of strong 
interactions. 


Focus Box 9.15: SU(3) symmetry and the strong force 
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really deserve their name. However, their range is extremely limited despite 
the fact that their force quanta are massless. This is due to the fact that the 
strong coupling is anything but constant, but depends on momentum transfer 
more strongly than all other couplings. 


> 


Figure 9.12 Vacuum polarisation in QED. The Feynman graph on the left 
shows the contribution of fermion loops to the photon propagator. It leads to 
fermion pairs of opposite charge close to the source as sketched on the right. 


The fact that couplings are not constant is due to higher order contribution 
to the propagators of gauge bosons. When we measure the charge of a matter 
particle, we of course do that experimentally. The measurement automatically 
contains all orders of perturbation theory and is made at a given momentum 
transfer. However, we insert this coupling constant into an amplitude of fixed 
order and eventually at a different momentum transfer. This is taken into 
account by letting the coupling constant “run”. Contributions to the photon 
and gluon propagator are shown in Figures 9.12 and 9.13, respectively. In both 
cases, the contributions of fermion loops, called vacuum polarisation, lead to 
a screening of the electric and colour charge. And the more so, the larger 
the distance between projectile and target, i.e. the smaller the momentum 
transfer. The electric charge is thus a little bit smaller at large distances, a 
little larger at small distances. 

In contrast to that, the strong vacuum polarisation receives an additional 
contribution by gluon loops, and these strengthen the charge since they are 
made of bosons!*. They are also more numerous than fermion loops, such that 
the strong coupling grows rapidly with increasing distance. Colour charge is 
thus large at large distances, bound states cannot be separated. 

Strong vacuum polarisation, on the one hand, leads to a steep decrease of 
the colour charge at small distance, i.e. high momentum transfer. The quarks 
bound inside hadrons then behave much like free particles, asymptotic freedom 
is an intrinsic property of the theory. H. David Politzer, David J. Gross and 
Frank A. Wilczek shared the 2004 Nobel prize in physics for their discovery 


14Due to the electroweak self-couplings, also the photon and Z propagator receive 
strengthening contributions from weak boson loops. However, these do not play a significant 
role because of their high mass. 
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When bound quarks are struck by a gauge boson, or created as quark- 
antiquark pairs in vacuo by a photon or Z, they move apart with high momen- 
tum. As they do that, a colour field builds up between them -as an electric 
field does when electric charges are pulled apart. However, the electric field 
becomes weaker with increasing distance, the colour field quickly becomes 
much stronger. 


When the distance between the quarks is about = Pi 
1fm= 10~!°m, the potential energy stored in the ee 
field, which comes at the expense of the kinetic = elite ona 
energy of the quarks, is sufficient to form an addi- 1 33 32 J4 l 


tional quark-antiquark pair out of the vacuum, as He 222e RRBs 022e — 


sketched on the right. = 

This process continues until the kinetic energy is exhausted. It creates a multi- 
tude of hadrons moving along essentially in the direction of the initial quarks. 
The result is a hadron jet, examples are shown in Figure 9.14. One can model 
this process of jet formation, also called hadronisation or fragmentation, in a 
less chromostatic and more dynamic way as follows. 


One of the high momentum quark can emit a gluon. 

= This gluon can split into two according to gluon self- 

= coupling, or into a quark-antiquark pair. In fact, the 

ot > 7 two-gluon spilt is more likely, since there are more 
a 


allowed colour combinations. A cascade of quarks and 
gluons is thus created. 
When one of the quark-antiquark pairs comes close enough in phase space 
and the colours match, a hadronic bound state is formed. The end result is 
the same as in the chromostatic model: the quark transforms itself into a jet 
of hadrons around its initial direction. 


Focus Box 9.16: Hadron jets 
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Figure 9.13 Vacuum polarisation in QCD. The Feynman graphs on the left 
show the contributions of fermion and gluon loops to the gluon propagator. 
The gluon loop contribution is larger and leads to an amplification of the 
colour charge at large distances, i.e. small momentum transfer. This is exper- 
imentally verified by the dependence of as on momentum transfer Q, shown 
on the right [658]. 


of this feature of quantum chromodynamics. On the contrary, the coupling 
increases rapidly at large distances, such that quarks are forever trapped inside 
hadrons. At a distance of about a femtometer, the energy stored in the gluon 
field between two quarks is so large, that quark-antiquark pairs are sponta- 
neously created. When the quarks are torn further apart, this process repeats 
until the kinetic energy of the coloured pair is exhausted. What remains are 
mesons, colourless bound states of quarks and antiquarks, but no free colour 
charges. This leads to the formation of hadron jets, which accompany the orig- 
inal quark direction with a small opening angle. The mechanism is explained 
further in Focus Box 9.16. It happens with probability one, so does not modify 
elementary cross sections. Figure 9.14 shows examples of hadronic jets in high 
energy reactions. Events with multiple jets, as shown on the right, come from 
gluon bremsstrahlung, here in the reaction ete” — ggg. The cross section 
is quoted in Focus Box 9.17. Such events were first observed by experiments 
at the PETRA collider of DESY, Hamburg. The gluon as a force carrier of 
strong interactions was thus discovered. There were four experiments running 
simultaneously when the collider reached an energy where the phenomenon 
became detectable. There is no consensus on which one can claim precedence 
in convincingly demonstrating the existence of gluons [481, 501, 607]. Once 
again, the experimental proof thus did not receive recognition by a Nobel 
prize, but the existence for gluons is at the same level as that for quarks, and 
so are asymptotic freedom and gluon self-coupling [568]. QCD thus exhibits 
exactly the properties that electron-nucleon scattering experiments require 
(see Section 9.1). 

Calculations based on QCD are difficult at small momentum transfers, due 
to the large coupling strength which limits the application of perturbation 
theory. Instead one can choose to build models based on specific properties of 
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ATLAS 


EXPERIMENT 


Figure 9.14 Left: Reaction with two back-to-back hadronic jets registered by 
the ATLAS experiment at the LHC collider. (Credit: ATLAS © CERN) Right: 
Electron-positron annihilation into a hadronic final state with three jets, reg- 
istered by the L3 experiment at the LEP collider. (Credit: L3 © CERN) The 
tracks of charged particles are shown as curved lines in the inner detectors, 
energy deposits in the calorimeters as histograms. 


QCD, which describe the phenomenology of strong interactions rather well. An 
example is the formation of jets [433, 434]. Bound states cannot be described 
by perturbative methods at all. Implementations of QCD on a lattice, i.e. 
with a minimal distance between quarks, which is then shrunk towards zero, 
deliver numerical results for relative bound state masses with minimal input. 
The reproduction of the spectrum of light baryons and mesons [573] shows that 
QCD is the right theory for strong interactions also in the non-perturbative 
regime. Heisenberg’s requirement of a dynamic theory for quarks in hadrons 
is thus brilliantly fulfilled. Moreover, implementations of QCD on a lattice 
even reproduce successful models of the nuclear binding potential (see Focus 
Box 5.12), a residual force transmitted by colourless objects. 


9.3 SPACE-TIME 


Emmy Noether’s theorem dictates that continuous symmetries lead to the 
conservation of all elementary charges. The three interactions described by the 
Standard Model, electromagnetic, weak and strong, have observed properties 
in agreement with a field theory based on the requirement of local gauge 
invariance under the joint group U(1)xSU(2)xSU(3). Gauge invariance has 
been uncovered by ’t Hooft and Veltman as a necessary condition for the 
renormalisability of a field theory. Without it, it makes no predictions which 
can be confronted with precision experiments. 
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The reaction ete~ — hadrons comprises many ele- 
mentary processes: ete” > gg, qag, qägg etc. The 
basic production process is an electroweak one, as e Qie 
shown in the Feynman graph on the right. For the 
electromagnetic process alone, the cross section for a 
given quark flavour 7 is analogous to the muon pair production quoted in 
Focus Box 9.11, o(ete” — qq) = 3Q?(4ra?/3s), with the fractional electric 
quark charge Q; and the center-of-mass energy ys. The factor 3 comes from 
the three possible colours of the quark, which the electromagnetic interaction 
does not distinguish. Colour is an observable, although we do not know how 
to measure it, and probably never will. Thus the cross sections add up, with 
no interference. If one does not distinguish quark flavour (although one can 
to some extent, at least for heavy flavours), one can also sum over all flavours 
i, where the quark mass respects the threshold ys > 2m;: 


+ 


e qi 


4 
o(e*e” — hadrons) = se ‚> Q? 


The colour and electric charge factor 3), Q? makes a jump at every threshold: 


3[(3)? + 2(4)?] =2 u,d,s Js < 2m. > 3.7 GeV 
24+3(2) = 2 u, d, s, € 3.7 < y5 < 2mp ~ 10 GeV 


8 +34)” = 4 u, d, s, c, b 10 < /s < 2m; ~ 350 GeV 


4 3(2)? =5 u, d, s, c, b, t 350 < ys 
er q 2 q  Gluon bremsstrahlung in the final 
state is sketched in the Feynman 
vg f 
N — g graphs on the left. The gluon emis- 
et q at a sion happens after the elementary 


pair creation by the photon, 
as does hadronisation. So this process cannot be distinguished from the one 
without gluon bremsstrahlung. There is thus an interference term of order 
aa, between the two (in addition to the smaller a?a? probability for the 
process itself), adding to the cross section: 


o(e*e” — hadrons) = u 2 Q? (1 (1+ =) 


Focus Box 9.17: Electron-positron annihilation into hadrons 
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Yet non-vanishing masses for matter and force particles are incompatible 
with gauge invariance, and consequently lead to divergences in the amplitudes 
and cross sections. One can see that e.g. by the Feynman graph of Figure 9.15. 
In this so-called box graph, two virtual force particles are exchanged between 
a pair of fermions. Momentum conservation does not limit the individual 
momentum transfers; any q exchanged in the first can be compensated by 
—q in the second. One must thus integrate the amplitude over the momentum 
transfer up to infinity. If the propagators of the virtual particles diminish like 
1/q’, like for photons or gluons, there is no problem, the integral converges. 
But if the propagator has a pole at q? = M? # 0, i.e. if the force particle 
is massive, the integral diverges and cannot be calculated without artificially 
limiting the momentum transfer. Such a cut-off can be introduced, with the 
hope that it corresponds to a new scale in the process. But new cut-offs would 
be necessary at every higher order, the theory makes no prediction beyond 
lowest order. Similar divergences occur when masses are taken into account 
for the virtual fermions in our example. 


Figure 9.15 A so-called box graph, a 

higher order contribution to the scatter- 

ing between two fermions. A momentum 

f 7 transfer q between f and f’ is compen- 
( Ù sated by an opposite one in the second 

q ? | = boson exchange. The energy-momentum 
circulating in the box is thus not limited 
f i ( ( f' by the kinematics of the incoming and 


outgoing fermions. 


There is thus a serious problem. Either no particle has an intrinsic mass, 
or the Standard Model is only valid as an effective theory. Matter particles 
and gauge bosons do have a mass, and some not even a small one. One way to 
solve this contradiction is to view mass not as an intrinsic property of particles, 
but as a dynamically generated one. A mechanism must be found by which 
massless particles are prevented from propagating at the speed of light. We 
then would have misinterpreted this mechanism hindering free motion as an 
intrinsic mass. The dynamical generation of mass is not at all an unfamiliar 
phenomenon as seen in Figure 9.16. While the masses of crystals, atoms and 
even atomic nuclei are dominated by their constituents, this is no longer the 
case for hadrons. The contribution of the up- and down-quark masses to the 
mass of protons and neutrons is small. Nucleon mass is dominated by the 
binding energy between quarks, provided by massless gluons. 

So the remaining question is: Can mass also be dynamically produced 
for point-like particles without an internal structure? Their mass spectrum 
is indeed impressively wide, as Figure 9.4 shows. It ranges from the neutrino 
mass, probably of the order of 10~%eV, to the top quark with 1.7 x 10!?eV, 
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Figure 9.16 Contributions of constituent mass and binding energy to the 
masses of crystals, atoms, nuclei and nucleons. From left to right, binding 
energy makes larger and larger contributions, for nucleons it dominates. 


spanning 17 orders of magnitude. While photons and gluons are massless, 
weak bosons also weigh of the order of 10!*eV. 

The quest for dynamically generated particle masses has found an answer 
in solid state physics in the 1960s, long before massive gauge bosons were dis- 
covered [355]. The Scotsman Peter Higgs and others later found out how sim- 
ilar dynamics can be applied to the gauge bosons of the Standard Model [370, 
375, 377, 378]. If massive fields break gauge symmetry, the opposite should also 
be true. A spontaneous breakdown of symmetry should be able to turn mass- 
less fields into massive ones. That breaking a symmetry can make massless 
systems massive can be made plausible with a simple mechanical Gedanken- 
experiment shown in Figure 9.17. Imagine an infinitely thin needle, balanced 
vertically on a flat surface. It has no moment of inertia around its axis. Now we 
stress the needle along its axis by putting a weight on top, large and beyond 
its elastic limit. The equilibrium is evidently unstable, a small fluctuation 
in the metal lattice will bend the needle spontaneously in an unpredictable 
direction. The system thus falls into its ground state of minimal potential 
energy. The axial symmetry of the system is broken, simultaneously it has 
developed a moment of inertia around the vertical axis. The loss of symmetry 
has generated a kind of mass. 

The mechanism of spontaneous symmetry breaking in particle physics, 
named after Peter Higgs, is also based on a scenario where a system in its 
ground state no longer exhibits a symmetry, which it does have in general. 
We discuss a simple example in Focus Box 9.18 to show how it works. If 
a ubiquitous scalar field, the Higgs field, results from such a spontaneous 
symmetry breaking, it can confer an apparent mass to all particles to which 
it couples, including itself. The quantum of this field is a spin zero particle, 
the Higgs boson. Its special feature is that it is present everywhere and all 
the time, as a property of the vacuum. It does not have to be produced, but 
interacts with particles anyway, if they have the required coupling. This way 
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140 


Vl 


Figure 9.17 A mechanical example demonstrating that the breaking of sym- 
metry can lead to mass-like phenomena, in this case a moment of inertia I. 
A symmetric unstable equilibrium with J = 0 is shown in the left, the asym- 
metric ground state with J #0 on the right. 


the Higgs field keeps them from evolving at the speed of light; we call this 
artefact their mass. One can picture this like the motion through a viscous 
liquid, except that particles do not lose energy by bouncing off the Higgs 
boson. Massless particles can thus indeed move at less than the speed of light. 
The apparent mass they have is determined by their coupling strength to the 
Higgs field. Of course, a single Higgs field is just the minimal scenario. More 
than one may be present. 

With the exception of its mass, which after all it generates itself, all prop- 
erties of the Higgs boson were known since the 1960s. But of course not, if it 
really existed as a solution to the mass problem. Its profile is given by the role 
it is supposed to fill: it is a scalar boson without charge, coupling preferentially 
to heavy matter particles and gauge bosons. It also decays preferentially into 
the heaviest particles kinematically accessible, but all final states compatible 
with conservation laws occur. Its wanted poster hung on the wall of several 
generations of accelerators and colliders. It has finally been identified in 2012 
by the experiments at the CERN Large Hadron Collider in a spectacular way, 
at a mass of about 125 GeV. An overview of the history of this search and 
discovery has been given e.g. by Karl Jakobs and Chris Seez [621]. Figure 9.18 
shows an example reaction recorded by the ATLAS experiment in 2016. The 
Higgs boson decays here into two Z bosons, one of which in turn decays into 
a pair of muons, the other one into an electron-positron pair. Since its discov- 
ery, many of the properties of the particle have been measured and found to 
be in agreement with the ones a Higgs boson should have!?. Nonetheless it 
remains uncertain if it is the only such particle; the search will continue. What 
is already clear is that the vacuum is not an empty theatre, where particles 
evolve and interact with each other. It is a rather bustling environment, which 
actively influences what happens. 


15See e.g. Heather Gray and Bruno Marsoulié, The Higgs boson: the hunt, the discovery, 
the study and some future perspectives, 
https://atlas.cern/updates/atlas-feature/higgs-boson 
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We construct a toy model with a photon-like gauge boson to demonstrate the 
functioning of the Higgs mechanism. We start with a Lagrange density for a 
complex scalar field ® interacting with a boson field A,,: 


1 
L = (3! + ie AH)®* (0, — ieA,)® — p?2@*b — \(S*b)? — mr 


It is invariant under U(1) gauge transformations, since we already included 
the required term j A” with the scalar current of Focus Box 7.2. For u? > 0 
this is the Lagrange density of QED from Focus Box 9.9, for a scalar field 
with mass u, extended by a self-interaction term between four ® fields with 
coupling A. The potential energy density, V = u?®? + A®4, has a minimum 
for ® = 0, easy to verify by calculating its first and second derivative with 
respect to ®. This is normal: when there is no field, the energy density 
should be minimum. The interpretation of the first term in the potential 
changes drastically when we choose u? < 0. The term proportional to ®? is 
no longer a mass term, since u is imaginary, ® is a massless field. Moreover, 
the potential has a local maximum at ® = 0. 


A minimum is found instead for a finite value 
of the field, |®| = v = \/—p?/X. The potential 
as a function of the real and imaginary part 
of ® has the shape shown on the right. It still 
is symmetric under U(1) phase transformation 
of the ® field. However, if we want to pertur- 
batively expand our toy theory, we must choose 
one of the (infinitely many) minima of its poten- 
tial energy density. Only then the system is sta- 
ble agains small perturbations. Our pertur- 
bative “vacuum” thus cannot be empty space-time, but contains a non-zero 
field ®(x) everywhere. Let us choose to expand our field around the specific 
minimum ® = (v/V/2)e**). Any @ is allowed by gauge freedom, but we have 
to chose one at each x. We can then write small perturbations as: 


Vid) 


D(a) = Lp + h(a) 


v2 


The Lagrange density in terms of the real field h is: 
,_1 2 2\,2, 12,2 1 v 
L= 5 (ult) =u Ah + zE” A,A" — qe + Int(h, A) 


The last term is a short for interactions and self-interactions of the fields. The 
angle 0 does not appear, A, absorbs it. But we now have mass terms for h 
with mn = V2Av? and the gauge field A,, with ma = ev, while there were 
no mass terms in the original Lagrange density. The U(1) symmetry of the 
original Lagrangian has disappeared, mass terms appear [361]. The field h is 
called a Higgs field, its quantum is a Higgs boson. 


Focus Box 9.18: A toy example of the Higgs mechanism 
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Figure 9.18 A display of a candidate Higgs boson event from proton-proton 
collisions recorded by the ATLAS experiment with LHC at a collision energy 
of 13 TeV. The candidate event is reconstructed in the u*uete” final state. 
The long grey lines show the path of the two muons including the hits in 
the muon spectrometer, the short grey lines show the paths of the two elec- 
trons together with the energy deposit in the electromagnetic calorimeter. 
The lighter tracks in the inner detector are the remaining charged particles 
from the Higgs boson candidate vertex. The other tracks come from addi- 
tional proton-proton reactions recorded simultaneously because of the high 
luminosity of the collider. (Credit: ATLAS © CERN) 
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The scenario of the Standard Model is thus completed. The systematics of 
matter, the motion of particles and also the forces which act on them are char- 
acterised by symmetries. Kinematics, equations of motion, follow from Lorentz 
invariance, dynamics from gauge invariance. The charges of electroweak and 
strong interactions are thus conserved, as the Noether theorem requires, just 
like energy and momentum. Obviously all these statements must constantly 
be challenged by experiment, to the accuracy achievable at any moment. On 
the other hand, quantum numbers, which are not protected by a symmetry, 
are at maximum approximately conserved. Examples are the flavour quan- 
tum numbers and particle properties under discrete symmetries, like the one 
between matter and antimatter. 

Thus symmetries are the key to microcosm. They allow the reduction of 
an enormous variety of physical phenomena to a few basic laws, as sketched 
schematically in Figure 1. One is allowed to hope that this reduction may 
continue, but one doesn’t have to. If a further step in reduction were possible, 
it would at least have to include a few phenomena outside the Standard Model, 
which we know about today and which we will discuss briefly in the final 
chapter. 
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CHAPTER 1 0 


Pushing the boundaries 


In moving from a theory to the theory that supersedes it, we do 
not save only the verified empirical content of the old theory, but more. 
This “more” is a central concern for good physics. It is the source, I 
think, of the spectacular and undeniable predictive power of theoretical 
physics. I think that by playing it down one risks misleading theoretical 
research into a less effective methodology. 

... figuring out where the true insights are and finding a way of 
making them work together is the work of fundamental physics. This 
work is grounded on confidence in the old theories, not on a random 
search for new ones. 


Carlo Rovelli, Quantum Gravity, 2004 [551] 


N THIS CHAPTER I describe known deficiencies of the Standard Model, 

including dark matter, dark energy and quantum gravity. I sketch some 
current directions of research trying to include them in a way compatible 
with the introductory quote. But without being sure that I include the right 
directions. Much of the physics later in this chapter is really beyond me. I am 
walking on rather thin ice here and what I write may be out of date tomorrow. 
So please take care when you decide to follow me onward. 


10.1 DARK MATTER 


With the discovery of the Higgs boson, the Standard Model as described in 
Chapter 9 is internally consistent. It might well be, that there is no funda- 
mentally new physics concerning particles and their interactions [581] until the 
Planck mass is reached, about 10!°GeV, or a million billion times the LHC 
energy. There might well be no supersymmetry, no multiple Higgs bosons, 
no structure to quarks and leptons. But even in this case, there is compelling 
evidence that the matter and energy contents of the Standard Model is incom- 
plete. 


Reproduced with permission of the Licensor through PLSclear. 
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The best know example is the astrophysical evidence for the existence of 
dark matter, a form of matter that emits no light, does not reflect or absorb 
light. Speculation about elusive forms of matter is as old as philosophical 
thinking [649]. The first to use gravitational observation to find an object 
invisible to astronomical telescopes was the mathematician Friedrich Bessel. In 
the 1840s he used accurate measurements of the stellar parallax? to conclude 
that Sirius had a dark companion, today known as Sirius B. His work also 
contributed to the discovery of Neptune a few years later. Thus even if they 
only manifest themselves through gravity, objects do not go undetected. 

The American astronomer of Swiss origin Fritz Zwicky is often cited as the 
discoverer of dark matter. This is arguably also because he had a very strong 
personality°; Freeman Dyson characterises him as “intensely Swiss” [690]. In 
1933 he studied the velocities of galaxies in the Coma cluster [256, 279]. He 
applied the virial theorem to link the mass of the cluster to the width of 
the velocity distribution measured by Edwin P. Hubble [240]. The virial the- 
orem applied to astrophysics and Zwicky’s argument are explained in Focus 
Box 10.1. From estimates of the number of galaxies in the cluster and their 
average mass he arrived at a total mass corresponding to an expected velocity 
variance of about 80 km/s. The observed velocity spread is much larger, 1000 
km/s along the line of light, estimated via Doppler shift measurements [240]. 
The visible mass is thus much too small to explain the variance of velocities. 
Zwicky concluded [256]: “If this should prove true, one would get the surpris- 
ing result that dark matter is present in much larger density than luminous 
matter.” In 1936, Sinclair Smith published a similar estimate [272] for the 
Virgo cluster. He also found a much larger mass than the value estimated 
from luminosity measurements by Hubble [261]. 

One of the most convincing proofs for the existence of dark matter are the 
rotation velocities of objects around their galaxies, thus one step in scale down 
from the cluster argument of Zwicky and Smith. They are also measured by 
the electromagnetic Doppler effect, which shifts the line spectra of abundant 
elements in young stars, like hydrogen and nitrogen, towards the red when 
the source recedes, towards the blue when it approaches. Spectroscopy for 
astronomy [601] works according to the principles of diffraction, as explained 
in Focus Box 5.10. Focus Box 10.2 shows how this is used to measure the radial 
mass distribution in spiral galaxies. Precision measurements of the velocity 
as a function of distance from the centre of a galaxy were pioneered in the 
spectroscopic survey of Andromeda by Vera Rubin and Kent Ford in the 
early 1970s [405]. In January 2020, it was announced that the Large Synoptic 
Survey Telescope (LSST), which is currently under construction in Chile, will 
be named the Vera C. Rubin Observatory. 


Stellar parallax is the apparent shift of the image of a near star with respect to a distant 
constellation as the Earth moves around the Sun. It can be used to triangulate the distance 
to the star. 

3For a lively account of the debate around Zwicky’s way of dealing with colleagues, 
and his daughter’s defence of her father, see the article by Richard Panek in Discover 
Magazine [575]. 
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The virial theorem states a relation between the mean values of kinetic and 
potential energies for a system in equilibrium [363]. The mean over time of 
the kinetic energy T for a system of N particles is: 


N N 


2(T)=-) (AF) = X (AVU) 


i=l i=l 


where F is the force at position f. The second equation is valid for conservative 
forces with the potential U. If this potential is radially symmetric and follows 
a power law U œ r*, like the gravitational or Coulomb potentials, we get the 
simplified form: 


On the other hand, the kinetic energy of a single particle of mass m and 
velocity v is T = ima. If we assume an equal partition of the velocities 
over the three spatial directions, then (v2 + v2 + v2) = 3 (v?), where v is the 
mean velocity along any line of sight. For the total kinetic energy we get 
2 (T) = 3M (v?) with the total mass M = ) m. The total potential energy 
is U = -aGM?/R where R is the total radius and G is the gravitational 
constant. The constant œ depends on the radial mass distribution. For an 
unrealistic radial equipartition one would e.g. get a coefficient of 3/5, more 
realistic coefficients are also of order one. Using the standard deviation go? = 
(v2) — (v)” as a measure for the width of the velocity distribution and (v) = 0 
for an equilibrium system, we get the mass estimator which Fritz Zwicky used 
for the Coma cluster: 


aGM _ 


R 30? ; M= 


3R, 
ag 


A wider velocity distribution of the galaxies in the cluster thus leads to a 
higher virial mass estimator. Zwicky estimated the total mass of the Coma 
cluster as 800 galaxies of a billion solar masses each, thus M œ 1.6 x 104?kg. 
Its radius is about a million light years, i.e. R ~ 10??m. The width of the 
velocity distribution predicted by the virial theorem is about o > 100km/s, 
while the observed one is of the order of 1000km/s. There is thus much more 
invisible mass present in the cluster than visible one. 


Focus Box 10.1: Virial theorem and Zwicky’s dark matter estimate 
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The Doppler effect causes the wavelength of light from a moving source to be 
shifted. When the source moves relative to the observer with velocity v, its 
wavelength A is shifted from its value at rest by AA: 


AX _ 1+v/c > 
A 1-v/e 


When the source is receding, z > 0, corresponding to a red shift for visible 
light. When it is approaching, z < 0 and visible light is blue shifted. Frequency 
standards like the bright (n = 3 — 2) line of hydrogen (H-a, A = 656nm) or 
the nitrogen line (NII, A = 658nm) are typically emitted by hot gas around 
young stars. Their wavelength shift is often used to measure source velocities. 


An example on how this is done is shown 
on the right. The photo shows a image of 
the spiral galaxy NGC3198 seen practi- 
cally edge-on. Luminous matter right of 
the galactic centre is receding, approach- 
ing on the left. The velocity component 
along the line of sight is measured. Using 
the orientation of the rotation axis, indi- 
cated by the line, it is converted to rota- 
tional velocities. Credit: Sloan Digital Sky Survey 
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The result is shown on the left [456], 
the rotational velocity ver as a func- 
tion of distance to the galactic cen- 
tre. Matter is moving with a velocity 
almost independent of distance from 
the galactic centre, even at points far 
outside the luminous region. This is 
siti! not what one would expect if only 


luminous matter were present. 
Credit: T.S. van Albada et al. [456] 


Stars and other bright objects in a spiral galaxy are moving around its axis 
with a roughly circular motion. The velocity as a function of distance to the 
axis, R, is given by Kepler’s third law: v?,,.(R) = MR), Here M(R) is the 
total mass included in the path and G the gravitational constant. Up to the 
galaxy’s outer limit, M(R) grows with radius and so does the rotational veloc- 
ity. Outside the visible limit of a few kpc, one should have M = const, and 
a decrease of the rotational velocity, vei, ~ 1/ VR. An example of a mod- 
ern measurement of the rotational curve is shown above [456]. One observes 
Veir(R) > const at large R. This means that there is an extended halo of 
invisible mass reaching far beyond the optical limit. The calculated velocity 
distribution caused by the galactic disk is shown by the curve, together with 
the deduced contribution by the halo of dark matter. 
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Focus Box 10.2: Doppler effect and rotation curves of spiral galaxies 
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Gravitational lensing, suspected to exist by Newton‘ and by Einstein prior 
to the formulation of general relativity [496], is a formidable tool to measure 
the mass of large astronomical structures, even when this mass does not emit 
light. The principle is based on the fact that light rays follow straight lines 
in space-time distorted by the gravity of objects. In this manner the gravity 
of a heavy object in the foreground causes multiple deformed images of an 
object in the background. Measuring the deformation of the image of a galaxy 
behind a cluster, for example, one can calculate the mass of the cluster in the 
foreground. This technique is used in the compound image of Figure 10.1 
showing the cluster of galaxies 1E 0657-56, better known as the bullet cluster. 
What is shown in white are bright objects from optical observation, thus 
normal matter. Overlaid in light grey is an image of hot gas taken with X- 
rays, showing normal matter again. And a gravitational lensing image in dark 
grey, showing the distribution of all matter. 

The image shows the situation after two clusters of galaxies collided. The 
light grey part corresponds to the luminous hot gas, and shows the defor- 
mation, deceleration by friction, and the coalescence which is expected after 
such a collision for ordinary matter. The collision leaves a trail of heated gas 
behind like a bullet passing through matter. The dark grey cloud shows that 
the bulk of the matter in both clusters is not luminous and extends farther 
than the luminous one. The smaller cluster now on the right has traversed the 
larger on the left with little disturbance for its dark component. Its bulk is in 
advance with respect to the much less abundant normal matter. 

One concludes that dark matter accounts for about 85% of the mass of 
galaxies and their clusters, but this percentage can vary a lot. A recently 
discovered galaxy cluster, named Dragonfly 44, is even suspected to contain 
almost 100% of dark matter [631]. 

Since it can be observed by different techniques, it is thus clear that dark 
matter exists. It contributes to the gravitational confinement of galaxies and 
their clusters, and is probably also involved in the formation of large cosmic 
structures. Its gravitational interaction is the same as for normal matter, we 
are thus entitled to call it matter even though it is unconventional since it 
does not appear to interact otherwise. If it consists of particles, and there is 
no reason to doubt that it does, they must have the following properties: 


e They must be electrically neutral, otherwise they would shine light. 


e They are moving at non-relativistic speeds, thus they are probably 
heavy. Dark matter of this kind is called cold dark matter. 


e They are probably fermions, otherwise dark matter would collapse. 


e They interact very weakly with each other and with normal matter, 
otherwise reaction products would be abundant. 


4“Query 1. Do not Bodies act upon Light at a distance, and by their action bend its 
Rays; and is not this action (caeteris paribus) strongest at the least distance?” [2, 3rd book, 
part I, p. 313] 
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Figure 10.1 A combined image, covering 7.5x 5.4 arcmin, of the galaxy cluster 
1E 0657-56, also known as the bullet cluster. The optical image from Magellan 
and HST shows galaxies as bright white spots. Hot gas in the cluster, which 
contains the bulk of the normal matter, is shown by the Chandra X-ray Obser- 
vatory image in light grey. Most of the mass in the cluster is shown in dark 
grey, as measured by gravitational lensing, the distortion of background images 
by mass in the cluster. (Credit: X-ray: NASA/CXC/CfA/M.Markevitch et 
al.; Optical: NASA/STScI; Magellan/U.Arizona/D.Clowe et al.; Lensing Map: 
NASA/STScI; ESO WFI; Magellan/U.Arizona/D.Clowe et al.) 


The search for hypothetical dark matter constituents, which we generi- 
cally denote with x, is thus well targeted. They are attacked from three sides 
simultaneously, as schematically depicted in Figure 10.2: 


e One can try to produce dark matter pair-wise at high energy colliders 
like the LHC. Since they do not interact much with ordinary matter, 
their presence would be signalled by missing energy and momentum 
in the final state [654]. So far, the energy of the colliders and/or the 
sensitivity of experiments has been insufficient for an observation. 


e One also searches for the rare interactions of dark matter particles with 
ordinary matter, in a scattering process. This is sometimes called direct 
detection. It requires large mass cryogenic detectors sensitive to the tiny 
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recoil that a x-nucleus scattering would cause [685]. Despite the ever 
increasing sensitivity of experiments, there has been no detection yet. 


e Since dark matter particles are their own antiparticles, they can annihi- 
late into pairs of ordinary particles. These can then be observed as an 
extraordinary contribution to cosmic rays, neutrinos or photons [606]. 
Since dark matter is everywhere, observatories covering a large pro- 
portion of the sky are the most promising options. The Fermi satellite 
observatory and ground based observatories for high energy gamma rays 
like H.E.S.S., Veritas and MAGIC are examples for photons. The AMS 
observatory for charged cosmic rays on the International Space Station 
is an example for charged cosmic rays (see Section 10.2). The neutrino 
observatory IceCube at the South Pole may detect neutrinos from dark 
matter annihilation [640]. 


Dark matter particles are thus fenced in from all sides and it seems that their 
discovery is only a matter of time and energy. One cannot ignore, however, 
that they may escape detection forever. A popular model for undetectable 
dark matter are the so-called sterile neutrinos [553, 555], right-handed com- 
panions of ordinary neutrinos with a modest mass of order 10keV. Because of 
their wrong spin orientation, their interactions with matter are heavily sup- 
pressed, such that they may hide forever. But then again the existence of 
sterile neutrinos may be detectable by other means [576]. 

Dark matter may in fact not be the only explanation for the observed 
phenomena. As has been shown by Mordehai Milgrom [448, 449, 450], one 
can also modify Newtons law of universal gravitation such that it stays valid 
at and below solar system dimensions, but is modified at larger scales. Such 
efforts have also been made to explain away dark energy, which we will discuss 
in Section 10.3. 
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10.2 COSMIC RAY REVIVAL 


We have seen in the previous section that astrophysical phenomena play an 
increasingly important role in particle physics. They are no longer used as a 
source of energetic particles, but serve to investigate energetic processes in the 
cosmos, or the particle content of the Universe. Energies in the cosmos exceed 
man-made energies by many orders of magnitude. The spectrum of cosmic 
rays extends all the way to 10?°eV, there must thus be cosmic accelerators 
which can reach such fantastic energies. The necessary condition is that the 
size and magnetic field of the cosmic structure must be large enough to contain 
the cosmic rays. This has first been formulated by Michael Hillas [454]. The 
product of the magnetic field and the radius of the source must be larger than 
the ratio of momentum p to charge Ze of the accelerated particle. The latter, 
often called magnetic rigidity R = pc/(Ze), is thus the relevant kinematic 
quantity for charged cosmic rays. 

These considerations lead to the so-called Hillas diagram shown in Fig- 
ure 10.3 classifying potential sources for very high energy cosmic rays. In 
addition, the source class must be able to provide the necessary power and 
it must be sufficiently abundant to sustain the observed flux of particles. In 
general, the flux of cosmic ray particles, i.e. the number passing near Earth 
per square meter and second, decreases as a power law of rigidity, with a 
slope of roughly three orders of magnitude per decade. This means that the 
flux drops from about 1 particle per m”? per second at a few times 10GeV 
to one particle per km? per century at EeV energies, 10!%eV. Thus the size of 
the detector determines the observable energy range, not because very high 
energies cannot be measured, but because they occur so rarely. Above about 
5 x 10'%eV, the range of cosmic rays travelling through interstellar matter 
is severely limited by the large resonant cross section of protons interacting 
with photons of the cosmic microwave background (py — At, see Sections 9.1 
and 10.3). Due to this process, the flux cuts off rather sharply at such ener- 
gies [617, 642]. The cut-off is called the Greisen-Zatsepin-Kuzmin limit [390, 
393]; it is indeed observed [666, 687]. 

Observables for cosmic rays are their composition, charge and flux as a 
function of rigidity or energy, at or near Earth. The observed abundances 
are dominated by hydrogen nuclei, i.e. protons (87%), and helium (9%). A 
few percent are heavier nuclei and electrons. Antimatter, on the contrary, is 
surprisingly rare, at the level of per mil for positrons and 1075 for antipro- 
tons. Heavier antinuclei have not been detected with the required certainty to 
claim an observation. The chemical composition of cosmic ray nuclei [653] is 
similar to the composition of matter in the solar system [545], with notable 
exceptions. Due to their larger binding energy, elements with even charge are 
more abundant in both than those with odd charge. However, the depletion 
of odd atomic numbers is less pronounced in cosmic rays, since heavier nuclei 
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Figure 10.3 The so-called Hillas 
54 f diagram [574] classifying potential 
sources for high-energy cosmic rays 
according to their size R and magnetic 
field B. Sources which can accelerate 
protons to E > 10?!eV should lie 
above the upper line, sources above 
the lower line can accelerate iron 
555 nuclei up to 107°eV. (Credit: M. 
log(R/km) Kachelriess [574]) 
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can be split by spallation reactions? and provide disfavoured species. This is 
especially visible in the case of lithium, beryllium and boron [656], interme- 
diate products of stellar nucleosynthesis of the C-N-O group [644, 657]. They 
are therefore very rare in the solar system matter, much more abundant in 
cosmic rays due to spallation. All in all, the chemical composition supports 
that cosmic rays consist of stellar matter, ejected e.g. in supernova explosions. 
They come to us suffering important alterations of composition and spectra 
by interactions with interstellar matter as well as diffusion by magnetic fields 
at all scales, from solar to galactic and intergalactic. 

At modest energies, from MeV to TeV, spectrometers and calorimeters 
can be deployed in space to have direct access to cosmic ray data. Detec- 
tion of the rare antimatter contents, which is especially sensitive to potential 
contributions from dark matter, is of course restricted to magnetic spectrom- 
eters measuring the sign of the particle charge. Examples of modern space 
experiments are given in Focus Boxes 10.3 and 10.4. The results concerning 
ordinary matter and dominant contributions to the cosmic rays flux generally 
agree well with each other [618, 622, 648, 683]. Precision has vastly improved 
with respect to previous rather qualitative measurements in this energy range. 
The spectra at GeV to TeV energies contain important information about the 
sources, acceleration mechanisms and transport of cosmic rays confined in the 
Milky Way. For many nuclei, they reveal a new scale of the magnetic rigidity 
R = E/Z around a few 100 GV, where the slope of the spectrum changes. 
Spectra of nuclei are fed into numerical models of cosmic ray propagation like 
GALPROP [510] and interpreted this way in terms of cosmic accelerators and 
particle transport through the galaxy. 

In the energy range from 1015 to 10!8eV one suspects the transition from 
galactic to extragalactic origin [579] of cosmic rays. Spectral features like the 
so-called knee and ankle in the all-particle spectra mark this energy range. 


5Spallation is the split of a heavy nucleus into lighter ones by inelastic scattering. Spal- 
lation products fly off with basically the same speed as the original nucleus. 
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The flux of cosmic ray particles (the number per unit time, surface, solid 
angle and energy) falls rapidly, roughly by a factor 1000 per decade in energy. 
The product of detector surface, angular acceptance and exposure time 
determines what energy range can be observed. 


The composition is dominated by 
light nuclei, such that et/p ~ 
p/e” > 1074. To detect rare 
species, like cosmic antimatter, 
excellent background rejection is 
key. The photo shows the AMS 
detector installed on the Interna- 


tional Space Station since May 
2011. 


The heart of AMS, as shown in the 
right graph with a high energy elec- 
tron traversing, is a magnetic spec- 
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magnet with field strength 0.15T and = 
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The Time-of-Flight system (TOF) determines the direction of flight, measures 
the velocity and the charge |Ze|. The Ring Imaging Cherenkov counter (RICH) 
also determines velocity and charge. The particle energy is finally measured 
in a calorimeter (ECAL) at the very bottom. AMS registered more than 100 
billion cosmic rays up to May 2017, the largest sample ever recorded. After 
a replacement of its cooling system in early 2020, it continues to take data, 
probably until the end of the ISS lifetime. 


Focus Box 10.3: The space borne cosmic ray spectrometer AMS 
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x If one gives up on charge deter- 
| mination, one can no longer distin- 
guish matter from antimatter, but a 
calorimeter suffices. This is the prin- 
ciple of the CALET detector [688], 
accommodated on the Japanese Kibo 
module of the ISS, as shown on the 
photograph. It measures the energy of 
e* and nuclei in the MeV to TeV range 
and distinguishes the two by the shape 
of the shower. 


Credit: JAXA 
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Installed in October 2015, CALET registered 630 million cosmic rays up to 
May 2018. It will continue data taking for at least five more years. 


Focus Box 10.4: Space borne cosmic rays calorimeter CALET 
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Figure 10.4 The flux ®(E) of cosmic ray positrons, multiplied by E? for better 
visibility of the high energy part. The data from the AMS experiment (dots 
with error bars) [681] are compared to a generic two-component model (thick 
curve and error band), describing conventional sources (labelled diffuse term) 
and an additional unconventional source (source term). (Credit: AMS Coll.) 


At these extreme energies the low fluxes require square kilometre detector 
sizes for meaningful measurements. Terrestrial cosmic ray observatories thus 
use the Earth atmosphere as a calorimetric detector. When high energy pri- 
mary particles enter the atmosphere, they interact with air molecules and 
cause an extensive shower. If the primary particle is a photon or electron, an 
electromagnetic shower will develop; its characteristics are the same as in a 
calorimeter detector, but scaled up by the low density and light composition 
of air. Cosmic protons and nuclei cause hadronic showers consisting of a large 
number of hadrons. All of these shower particles excite air molecules, which 
then emit fluorescent light in the UV range. It can be detected and located by 
appropriate cameras. The different shapes of the showers permit to distinguish 
electromagnetic and hadronic ones. When the shower hits the Earth surface 
it has a diameter of order kilometre. 

One can observe and measure air showers by two methods: fluorescence 
light and Cherenkov light caused by the relativistic components of the shower; 
and shower particles arriving on the Earth surface. Both measurement meth- 
ods are briefly explained in Focus Box 10.5. While lower energy cosmic rays 
are scrambled by magnetic fields and lose memory of their original direction, 
very high rigidity particles ought to be stiff enough to roughly point back 
to their origin, or at least show some anisotropy. Prime suspects for sources 
are Active Galactic Nuclei, which have to be within 100Mpc > 3 x 10?!m, 
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Credit: Pierre Auger Coll. [615], reprinted by permission from Elsevier 


The largest air shower detector in operation is the Pierre Auger Cosmic 
Ray Observatory in Argentina [615]. The lefthand graph above shows the 
geographic area covered by the detector, about 30 times the size of Paris. 
The straight lines show the line of sight of 27 florescence detectors installed 
in four places. The photo on the right shows one overseeing the site. The 
dots indicate the locations of 1660 surface detector tanks filled with water. 
One is seen in the lower right corner of the photograph. When relativistic 
particles from the tail of the air shower pass the water, they emit Cherenkov 
light, which is detected by photomultipliers. The complementary information 
from the fluorescence cameras and the surface array is used to determine 
energy and direction of the primary cosmic ray and the starting altitude of 
the shower. The latter gives an idea on the identity of the primary. 


CTAO/M.-A. Besel/IAC (G.P. Diaz) /ESO 


Primary photons or electrons cause an electromagnetic shower. The shower 
electrons are relativistic and emit Cherenkov light when traversing the atmo- 
sphere. The UV light follows the original direction of the primary particle, 
its intensity is proportional to energy. Cherenkov light is reflected by conven- 
tional mirrors. Currently operating telescope arrays to detect these showers 
are H.E.S.S. in Namibia [530], MAGIC on the Canary islands [554] and VER- 
ITAS in Arizona [552]. The upper righthand photograph shows a prototype of 
the large telescopes for the Cherenkov Telescope Array, CTA [650]. This array, 
currently under construction, will finally comprise more than 100 telescopes 
of three sizes in the northern and southern hemisphere, so that the whole sky 
can be observed. An artist’s impression of the array is shown in the lefthand 
picture. 


Focus Box 10.5: Terrestrial cosmic ray and photon obsservatories 
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because of the Greisen-Zatsepin-Kuzmin limit. Efforts to associate the few 
extreme energy cosmic rays observed so far with such sources have, however, 
had limited success [676, 679]. 

As mentioned in Section 10.1, the presence of light antimatter, positrons or 
antiprotons, in cosmic rays may be a signal for the annihilation of dark matter 
particles, e.g. via xx — ete”, or their decay, x — et /p+.... Matter and anti- 
matter would of course be balanced in such processes, but the contribution to 
matter fluxes will be swamped by ordinary astrophysical processes. Astrophys- 
ical antimatter production is suppressed by the general asymmetry between 
matter and antimatter in the Universe (see Section 10.3), so a non-standard 
contribution would stick out more. And indeed, both antiprotons [636, 673] 
and positrons [680, 681] show an excess with respect to what one expects 
from secondaries coming from the interaction of ordinary cosmic rays. The 
excess in positrons observed by the AMS cosmic ray observatory, taking data 
on the International Space Station since 2011, is shown in Figure 10.4. Two 
contributions are visible: one from conventional sources, ordinary cosmic rays 
diffusing through the galaxy and producing positrons by interaction, and an 
extra contribution taking over at some 20GeV energy and cutting off at close 
to 1TeV. Both contributions are described by an ad-hoc analytical formula, 
inspired by propagation models at low energies and bringing out the main 
features of the unconventional contribution. 

The origin of the unconventional contribution is not clear, dark matter is 
a valid candidate but not the only one [664]. Galactic pulsars [600] or other 
astrophysical sources [623] have been proposed. Recent data from the HAWC 
observatory [639] indicate that pulsars are not a likely source of the observed 
excess. More data may allow to find out the real origin of this fascinating new 
source of cosmic antimatter. 


10.3 DARK ENERGY 


We have seen in the preceding section that astronomical observation can point 
towards missing elements in the Standard Model. Now we call cosmology to 
the witness stand. Cosmology is the theory of the Universe as a whole and 
at first glance it has little to do with the physics of the smallest structures 
of matter and forces. However, particle physics has the outrageous ambition 
to describe everything, from beginning to end, from microcosm to the cosmos 
itself. So at some point there ought to be synergy when the extreme scales 
meet. And like all other physics, astrophysics and cosmology should become 
a branch of applied particle physics. 

But as long as there are giant gaps in the Standard Model the roles 
are rather reversed: cosmology serves to point out missing mechanisms and 
ingredients in our view of particles and their interactions. These are to be 
searched for by means of astrophysical observations, accelerator based and 
non-accelerator particle physics experiments. Complementarity between the 
two disciplines comes from the fact that large scale phenomena are dominated 
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by gravity, usually negligible at very small scales. Let us start with a quick 
fly-over of Big Bang cosmology. 

Observations indicate that our Universe has been created some ten billion 
years ago in a Big Bang, out of a point-like singularity. Since then, it has 
expanded under the influence of the total energy it was born with, plus known 
and unknown forces. There are common misconceptions about the expansion 
of the Universe, partially created by sloppy language used by scientists. The 
expansion bears no resemblance to objects flying apart after an explosion. 
Instead it is space itself which is expanding, such that even objects at rest 
move apart. A two-dimensional model like the one shown in Figure 10.5 may 
help your intuition. Imagine a rubber sheet of size Rx R which is pulled apart 
at a constant rate, such that both dimensions grow by a constant percentage 
AR/R per laps of time At. The upper figures show what happens to a regular 
grid of points on the sheet. Now image that you sit at a point on the sheet, 
like position 1 or 2 in the lower part of Figure 10.5, and measure the recession 
speed of objects surrounding you. You notice that all other points move away 
from you, none towards you, independent of where you sit. And you notice 
that points further away recede at a higher speed than nearer ones. This is due 
to the fact that R = AR/At « R. In contrast to objects flying apart after an 
explosion, there is thus no privileged point from which the expansion starts, 
no origin. And the speed of recession is proportional to distance, regardless of 
where the observer is. 


Figure 10.5 A two-dimensional model 
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This is what is happening to the Universe. Since the Big Bang, all lengths 
have constantly grown. Indeed all lengths, including e.g. the wavelengths of 
light. Thus light emitted at a certain time in the history of the Universe has 
changed to a longer wavelength since then. The temperature of the Universe 
has thus decreased over time; and conversely, it has been a very hot place in 
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the beginning. Expansion has since cooled it down to its current rather frisky 
temperature of 2.7 K. 
This view of our Universe is supported by a few basic observations: 


e the red-shift of line spectra from far away galaxies, which leads to the 
Hubble-Lemaitre law and deviations from it; 


e the relative abundances of light elements synthesised during the first few 
moments of the Universe; 


e the cosmic microwave radiation, a black body radiation indicating tem- 
perature, and its relation to the geometry of space-time and large scale 
structures. 


We will discuss these pillars of modern cosmology one by one. 

The first observation has a complex history about which there is no consen- 
sus among historians [603]. In any case, towards the end of the 1920s, Georges 
Lemaitre [220] and Edwin Hubble [231], mostly based on observations of oth- 
ers, noticed a linear relation between the red-shift of spectral lines and distance 
to the source, indicating that our Universe is expanding. The observation and 
its relation to space expansion is explained in Focus Box 10.6. Lemaitre, a Bel- 
gian priest teaching at the Louvain Catholic University, published his findings 
in an obscure journal; when the English translation appeared in 1931 [241], 
he deleted the concluding paragraphs about the implications for an expanding 
Universe, but published them as a separate paper [242]. Thus readers could 
not realise the precedence for Lemaitre’s findings, Lemaitre himself didn’t 
care. Hubble’s name stuck to the linear law until the International Astronom- 
ical Union voted in 2018 to rename it Hubble-Lemaitre law. General relativity 
allows to understand the expansion process, in terms of a homogeneous and 
isotropic Universe of constant density, via the expansion equation first formu- 
lated by the Russian and Soviet physicist Alexander Friedmann in 1922 [187]. 
A simple Newtonian reasoning [546] also leads to the Friedmann equation, as 
Focus Box 10.7 shows. 

Steven Weinberg has given an admirable description of the very early times 
after the Big Bang [475]. Light nuclei were formed about one second into the 
life of the Universe. The relative abundance of light nuclei, can be understood 
with a simple energy argument [522]. All short-lived particles had decayed by 
that time, only electrons, neutrinos, photons, neutrons and protons populated 
the very hot soup. Their relative numbers depended on the equilibrium of 
weak interactions processes: 

vne p ; Ppoetn ; nape De 
Due to the expansion, the thermal energy became smaller and smaller. At some 
point, it fell below the nucleon mass and nucleons became non-relativistic. At 
sufficiently small temperatures, the reaction rates became too small to main- 
tain an equilibrium between the reactions converting neutrons into protons 
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The spectral lines from galaxies at cosmological distances, further away than 
about 15 Mpc > 4.6 x 10?°m, are shifted towards longer wavelengths. This is 
due to the Doppler effect caused by their recession with respect to us, or any 
other observer in the Universe. The wavelength X’ of a source moving with 
velocity v/c with respect to the observer is shifted with respect to the emitted 
wavelength A in the source rest system: 


j l+v/c _ 
A ee 


The second equation uses the red shift parameter z = AX/X often found in 
this context. One uses the red shift to measure the velocity of recession. The 
optical magnitude of a cosmic object can serve as an estimator of distance. 
The measured flux F of photons diminishes as the square of the distance such 
that a so-called luminosity distance, Dg = y L/(4r F), can be deduced if the 
total luminosity L is known. For certain classes of objects, like Cepheids or 
type-Ia supernovae, it can be reliably estimated. What one finds is a linear 
dependence of the red shift on distance r, called the Hubble-Lemaitre law: 
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Recession velocities are thus proportional to distance, with a coefficient called 
the Hubble constant H. This law corresponds to a uniform expansion of all 
distances, including wavelengths, by a common factor R(t), as demonstrated 
in Focus Box 10.7. Thus a length ro measured today compares to a length 
measured at another time t like r(t) = ro R(t). Without loss of generality one 
can normalise all lengths to ro; R thus becomes a generic scale factor applying 
to all lengths. 

Hubble’s constant is given by H = R/R. Contrary to its name, it is not a 
constant, because of the decelerating effect of gravity and the acceleration 
by dark energy (see Focus Box 10.7). The Hubble Space Telescope key 
project [527] finds a current value of Ho = 74.0 + 1.4(km/s)/Mpc from 
relatively near-by sources [667, 668]. Results derived from cosmic microwave 
background — thus the early Universe — appear to give a lower value [684]. 
For an up-to-date review of cosmological parameters, see: 

http: //pdg.1bl.gov/2019/reviews/rpp2019-rev-cosmological-parameters.pdf. 


Focus Box 10.6: The Hubble-Lemaitre law 
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The Friedmann equation for the large scale evolution of the Universe assumes 
a homogeneous and isotropic mass distribution characterised by the mass den- 
sity p. A point mass m at a large distance R from Earth will feel the attrac- 
tion of a mass M = an R3p. Its Newtonian equation of motion will thus be 
mR = —GyM m/R? with Newtons gravitational constant Gy. Integration 
gives: 


snk? Gye = const = 5 Kme 
The integration constant K is introduced such that it is dimensionless. The 
two terms on the left are the kinetic and potential energy of our point mass, 
so the term on the right is the total energy. For K = —1, it is positive and 
expansion continues forever with the speed of light as its asymptotic velocity. 
K = +1 corresponds to a deficit of total energy, expansion will stop and 
reverse at some point. The very special case K = 0 corresponds to a perfect 
balance between kinetic and potential energy, with the expansion velocity 
tending towards zero when the kinetic energy is used up. The Universe will 
end up in a motionless nirvana. This defines a closed Universe. 

As Einstein realised, one can add a further term, with a so-called cosmological 
constant A, without violating general relativity. One thus obtains the complete 
Friedmann equation: 


.\ 2 

H (5) 2 8rGyp Ke? pi A 

3 R? 3 
The term « A describes a constant curvature of space that the Universe 
was either born with or which developed dynamically during its history. Inte- 
grating the Friedmann equation for the simplest case K = A = 0 in the non- 
relativistic regime, one obtains R(t) = (Gy M )st3. The Hubble constant Ho 
found today is thus related to the age of the Universe by Hy: = R/R = Što. 
With this, one finds a rather short age of to = (9.4 + 1.3) Gy, much below the 
age of the oldest measured cosmic objects, estimated to be about 14 Gy. Either 
K <0 or a non-zero value of the cosmological constant A would increase the 
predicted age. 
Again for K = A = 0, we can integrate the Friedmann equation a second time 
to obtain a critical density pe for which the Universe is just closed: 


One can measure all observed densities relative to this critical one and sum 
them up, to see if the closure condition is indeed satisfied or even exceeded. 


Focus Box 10.7: The Friedmann equation 
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(with a gain in energy) and protons into neutrons (requiring kinetic energy). 
The two baryons thus fell out of equilibrium, and neutrons started to decay 
freely increasing the number of protons. At the same time, deuteron bound 
states started to be formed. For a while, formation and dissociation stayed 
in equilibrium via the reaction y?H <> pn. When the temperature fell below 
the threshold, deuterons became stable and helium synthesis started. The last 
light nucleus formed in this chain process, called primordial nucleosynthesis, is 
lithium “Li. Heavier nuclei are synthesised in stars by fusion. We give a quan- 
titative glimpse of primordial nucleosynthesis in Focus Box 10.8. The result 
is that the number of helium nuclei relative to hydrogen is predicted to be 
about 8%, as indeed observed in the Milky Way and close to what is found 
in cosmic rays (see Section 10.2). The hot beginning of the Universe is thus 
confirmed. 

The total abundance of nuclei corresponds to a total baryonic mass density 
of pg = (3.0+1.5) x 10”2®kg/m? or a number density of Ng = (0.18+0.09)/m?. 
Comparing to the number density N, of photons found in the Cosmic 
Microwave Background, discussed below, on finds: 


Ng 


~ (42) x 10-10 
N, ( ) x 


The numbers were of course comparable while the thermal equilibrium 
between radiation and matter was still intact. The vast majority of baryons 
has thus disappeared, leaving baryons in a minority of a few in ten billion 
photons. 

In the equilibrium period, the reaction pp + yy also guaranteed an equilib- 
rium between matter and antimatter. One would thus expect an almost total 
annihilation between matter and antimatter, occurring after the temperature 
fell below the threshold required to maintain this equilibrium. Today, only an 
even tinier but equal fraction of matter and antimatter, Np/N, = Ng/Ny > 
10718, ought to have survived. 

This is evidently not the case, otherwise we would not exist. One observes 
a matter fraction eight orders of magnitude larger and almost no antimatter. 
Man-made objects have visited almost all objects in the solar system without 
exploding in matter-antimatter annihilation, there are thus no close-by anti- 
planets. In fact also the Milky Way and its supercluster are made of matter. 
In cosmic rays, the abundance of antiprotons is compatible with secondary 
production by matter cosmic rays, up to a small excess [636, 673]. Heavier 
antimatter has not yet been convincingly observed, with current limits on 
the ratio Ng>/Nue = 10~"[588]. The only other antimatter which has been 
observed are positrons, which we discussed in Section 10.1. 

There are two ways to reconcile the apparent absence of antimatter and 
the principles of a hot Big Bang, symmetric between matter and antimatter. 
There could be an as yet unknown mechanism which has segregated matter 
and antimatter into separate portions of the Universe, and we happen to live in 
a matter dominated region. In that case, at some point an antihelium nucleus 
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When the thermal energy of nuclei, kT with Boltzmann constant k and 
temperature T, becomes smaller than their mass, My, they become non- 
relativistic. Their relative number is given by the factor N,/Np = er, with 
Q = Mn — Mp = 1.3 MeV. At sufficiently low temperatures, kT œ 0.87 MeV, 
the rate of inverse reactions re-creating protons out of neutrons, which main- 
tains the equilibrium between the two species, becomes too low and the two 
populations decouple from each other. Thermal thresholds are of course not 
very sharp because of the long tail of the Maxwell-Boltzmann distribution 
towards high energies. 


The relative number of nucleons at that point was N„(0)/N,(0) = e 
0.23, and neutrons started to decay. At time t there still were N„(t) = 
N,(0)e~'/, while the number of protons increased to N,(t) = Np(0) + 
N,(0) (1—e7*/™), with the lifetime of free neutrons, tT, = (880 + 

s. The ratio of neutrons to protons thus evolved like N,(t)/Np(t) = 
(0.23e-'/™) /(1.23—0.23e—/™ ). If nothing else had happened, neutrons would 
have exponentially disappeared from the Universe and only protons would be 
left. 

But protons and neutrons form stable nuclei, and inside those the neutron 
lifetime is no longer limited. Nucleosynthesis starts with deuteron formation, 
n p 77H. The binding energy is very large, E, > 2.2 MeV, so is the cross 
section for this process, o ~ 0.1mb. The formation process thus stayed in 
equilibrium with the inverse photo-desintegration, down to a temperature of 
about kT ~ 0.05 MeV. This threshold crossed, 7H became stable and helium 
synthesis started: 


°H n> °H y 
3H p > tHe y 


; °Hp>’į°Hey 
; Hen > tHe y 


At this moment, t œ~ 400s, the ratio of neutrons to protons was r = 
Na (t)/Np(t) = 0.14. Since one needs two protons and two neutrons to synthe- 
sise a “He, the ratio between the helium and hydrogen masses turned out to 
be (with Mye ~ 4Mp, Np = Nu + 2Nue, Nn = 2Nue): 


Experimentally one finds indeed a mass ratio of 0.24 + 0.1 at the level of our 
Milky Way [509], and Nye/N, about 8%. This agreement between prediction 
and observation is one of the big triumphs of the Big Bang model. In the 
same vein, the abundances of 7H, *He and ‘Li are predicted correctly. They 
exceed significantly their abundances inside stars, and indeed come mostly 
from primordial nucleosynthesis. 


Focus Box 10.8: Primordial nucleosynthesis 
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might diffuse into the solar system and be observed by a cosmic ray spectrom- 
eter like AMS. Even more exciting, if one would find a heavier antinucleus, 
e.g. anticarbon or antioxygen, one would prove the existence of antistars some- 
where in our galaxy, since nuclei heavier than lithium are synthesised in stars. 
As unlikely as this might be, the importance of such a possibility justifies the 
experimental effort. 

The second possibility is that all primordial antimatter has disappeared 
during the history of the Universe. In 1967, Andrei Sakharov, one of the 
fathers of the Soviet nuclear bomb program (see Section 6.5) and laureate of 
the Nobel peace prize in 1975, formulated the necessary conditions for making 
antimatter vanish dynamically [395]: 


e There must be an interaction which violates the conservation of baryon 
number. 


e In order to tell this interaction the difference between matter and anti- 
matter, the symmetry between them must be violated. 


e The disappearance must occur during a period of non-equilibrium, such 
that antimatter is not recreated by an inverse reaction. 


Among these three conditions, only the second has so far been experimentally 
established by observing the violation of CP symmetry (see Section 9.2) at 
a very low level. If cosmic ray experiments continue to tighten the limits 
for surviving antimatter, the fascinating mechanism of baryon number non- 
conservation must exist. Theoretically conceivable but without experimental 
support, this clearly challenges experimental particle physics. 

The third supporting evidence for the Big Bang origin of the Universe 
comes from the so-called cosmic microwave background (CMB). This radia- 
tion was discovered by Arno Penzias and Robert Wilson in 1965 [387]. They 
worked at Bell Labs, a private research establishment with no less than nine 
Nobel laureates among its collaborators. The original intention was to detect 
radio waves reflected off Echo balloon satellites for communication. They con- 
structed a 6m horn antenna, with a receiver cooled down to liquid helium 
temperatures, 4K, so that the faint signal from the balloon would be recognis- 
able. Despite all efforts to reduce noise, an irreducible random signal remained. 
It is a constant radiation from the cosmos in the microwave band, with a peak 
wavelength of 7.35cm. Its wavelength distribution corresponds to that of a 
perfect black body (see Section 5.1) with a temperature of about 2.725K. 

The origin of this radiation is the time when the Universe passed from 
an opaque to a transparent state [397, 400]. Before this time, photons and 
atoms were in equilibrium via the reaction yH + e”p and photons could not 
penetrate very far. When the wavelength of photons fell below the ionisation 
energy of hydrogen, some 390,000 years after the Big Bang, hydrogen atoms 
became stable and the photon radiation decoupled from matter. What we see 
today as cosmic microwave background is the relic of this radiation, cooled 
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down to today’s temperature by the expansion of the Universe. Photons from 
the CMB are the oldest electromagnetic radiation which we can observe. 

Since the CMB is the ideal black body radiation, it maps the temperature 
distribution of the early Universe. The satellites COBE and WMAP of NASA, 
as well as the Planck satellite of ESA have done so with ever increasing angu- 
lar and temperature resolution. The two leading scientists of the pioneering 
COBE mission, George Smoot and John Mather, received the Nobel prize in 
physics in 2006 for their work on the project. The temperature turns out to 
be surprisingly homogeneous, globally 2.725K with fluctuations of only 1 part 
in 100,000. This is hard to understand, since there are regions in the sky far 
enough apart so that they were never in thermal contact for an expansion 
with less than the speed of light. There must thus have been a period of more 
rapid expansion, the so-called inflation period. During this period the Uni- 
verse expanded with more than the speed of light. There is no contradiction 
to special relativity, since it is space itself that expands. 

The angular structure of the tiny temperature fluctuations can mea- 
sure the curvature of space-time, since it acts just as a gravitational lens. 
Roughly speaking, the curvature is measured by the size of cold or hot spots 
in microwave background temperature maps. The larger the curvature the 
smaller the physical scale of the spots. The most important result is probably 
that the observable Universe has no overall curvature, its geometry is flat with 
percent precision [602]. There is of course local curvature transmitting gravity, 
but here we are talking about global features, at distances much larger than 
a galaxy cluster, characterising the observable Universe as a hole. Flat space 
means that the global geometry is Euclidean, the three angles in any triangle® 
sum up to 180°. The clumps in the distribution of large visible structures in 
the Universe corroborate this finding [566]. 

This means that the overall matter and energy density must be exceedingly 
close, if not identical, to the critical density defined in Focus Box 10.7, of the 
order of 10”2°kg/m? or roughly 10 hydrogen atoms per cubic metre. Since 
the matter density, luminous and dark taken together, is no more than about 
32% of the critical density [602], a dominating component must be missing. 
It has been identified in the late 1990s as dark energy. 

Dark energy is again detected using the relation between recession speed 
and distance. It turns out that for very distant sources, the recession velocity 
is less than for small distances [502, 516]. Since photons from far away sources 
were emitted earlier, his means that the Universe is now expanding faster 
than in the past. For this discovery, Saul Perlmutter, Adam Riess and Brian 
Schmidt received the 2011 Nobel prize in physics. A few more details on the 
evidence are given in Focus Box 10.9. For a more in-depth discussion, I rec- 


6]f the curvature were positive, the total density would be larger than the critical one, 
and angles in a triangle would sum up to more than 180° like in spherical geometry. If it 
were negative, the total density would fall short of the critical one and angles would sum 
up to less than 180° in a hyperbolic geometry. 
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ommend Ruth Durrer’s article [583]. A comprehensive review of observational 
methods is given in [596]. 

Since the gravitational force is attractive, one would assume that the 
expansion of the Universe slows down as a function of time, but the oppo- 
site seems to be the case. There ought be a counteracting form of energy 
that supplies a repulsive force; it is called dark energy because again it is not 
subject to other forces. The cosmological constant A in the Friedmann equa- 
tion would represent such an energy. It would mean that space-time was born 
with the necessary curvature. Cosmology with this constant as well as cold 
dark matter, abbreviated ACDM, is often considered the Standard Model of 
cosmology. 

Alternatively, new scalar fields have been proposed to generate such an 
effect dynamically. So far, measurements of recession speed as a function of 
distance have stayed the only indication for the existence of dark energy and 
it is difficult to find more telling data [570]. 

While the flatness of space-time requires an additional energy density of 
non-standard origin, it is not unanimously accepted that an acceleration of 
space-time expansion exists or is due to an evenly distributed dark energy. 
The astronomers and theorists around Subir Sarkar of Oxford University, for 
example, have re-examined the evidence for an accelerated expansion with a 
large sample of type-Ia supernovae [634]. They argued that the significance 
for an observed effect is much less that previously thought. More recently, 
Sarkar and others also observed evidence for an anisotropy in the accelera- 
tion [675], roughly aligned with the dipolar anisotropy in the cosmic microwave 
background. This could indicate a peculiar local flow of our reference system. 
Thus some additional energy density ought to exist to make space-time flat, 
but it is unclear if cosmic acceleration has to do with it. What stays a matter 
of fact is that most of the Universe is dark. 

Like in the case of dark matter, one can alternatively invoke modifications 
of the laws of gravity to explain a possible acceleration of the Universe’s 
expansion without dark energy. The ways to do this and the experimental 
methods which may distinguish between dark energy and modified gravity 
are beyond the scope of this book. An excellent review has been recently 
presented by the Theory Working Group for the Euclid space mission [655], 
ESA’s major future project to look into the expanding Universe. 

Gravity thus plays a major role wherever the Standard Model of particle 
physics is known to be incomplete. Finally understanding this interaction, 
which men and women have known the longest, is a clear priority. 


10.4 QUANTUM GRAVITY 


Do we need a quantum theory of gravity? Can we not just live with its coverage 
by general relativity, as outlined in Section 3.5, summarised by Wheeler’s slo- 
gan [409]: “Mass tells space-time how to curve, and space-time tells mass how 
to move?” In general relativity, no particles are exchanged, the gravitational 
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The luminosity distance, Dz, is related to the astronomical magnitude m, a 
conventional measure of brightness, by the relation m(z) = 5logıo Dr + 25. 
At non-relativistic velocities, the red shift is z ~ v/c. For a Hubble constant 
which is independent of time, z x Dz, one would thus expect a relation 
between magnitude and red shift of the form m(z) = a + blogio z. If one 
samples a standard light source, with constant emitted luminosity, one can 
plot magnitudes against red shifts and test this hypothesis. Such a standard 
sample is supposedly given by type-Ia supernovae, thermonuclear explosions 
of white dwarf stars in binary systems. Their explosion releases a well 
defined amount of energy, such that its luminosity can be used to derive an 
astronomical standard candle with appropriate corrections. 


© AAS. Reproduced with permission. = The figure on the left [516] shows 
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At the same time, the light from these sources took longer to arrive than that 
from closer sources (empty dots) which lie on the Hubble-Lemaitre line. Thus 
recession speed is faster today than in the past. The data are compatible 
with domination by dark energy (uppermost dashed curve). 


According to these data, the global 
energy density of the Universe is com- 
posed of about 5% ordinary mat- 
ter, 27% dark matter and 68% dark Dark Matter 
energy. The total density is very close 
to the critical one. Dark energy is very 
much diluted, with a density of the 
order of (1.67 x 1072? kg/m?), such Petey 
that the solar system up to Pluto con- 
tains only about 6 tons. But its dis- 
tribution is supposedly uniform, such 
that it dominates evolution at cosmic 
scales. 


Ordinary Matter 


Credit: ESA Science and Technology 


Focus Box 10.9: Dark energy 
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field is the metric of space-time itself. Macroscopic measurements confirm 
the validity of general relativity [556], including tools that you use every day 
like the Global Positioning System [541]. After all, if quantum effects for this 
exceedingly weak interaction exist, they are only relevant when we approach 
the extreme distance of the Planck scale’, Ip = /hG/c? ~ 1.6 x 10735m, 
where all interactions are of comparable strength. We can obviously safely 
ignore the practical consequences of gravity at any energy reachable by even 
the most powerful accelerators. But why should gravity as the only force not 
be quantised? And why should we refrain from attempting to unify its descrip- 
tion with that of the other elementary forces? 

There are two complementary approaches to quantum gravity, which relate 
to these two questions differently. The first one, very much in line with the 
arguments we have presented in this book so far, treats gravity as yet another 
force. It tries to construct a quantum “theory of everything” on curved space- 
time. Gravity would then be transmitted by fluctuations of the metric over 
a flat four-dimensional Minkowski background metric, the one we used to 
describe the other Standard Model forces. The direct implementation of this 
idea suffers from the fact that the gravitational constant G is not dimension- 
less (see Section 2.1). It has the dimension of an inverse squared energy, lead- 
ing to infinities. The theory is not perturbatively renormalisable in the sense 
we superficially discussed in Section 9.2, it makes no predictions when using 
the usual tools. A successful implementation requires to eliminate infinities 
in a different way. If particles are not point-like objects, but extended one- 
dimensional structures called strings, non-renormalisable infinities disappear. 
String theory [677] sees particles as excitations of these strings, including a 
tensor-like particle as the potential quantum of gravity, the graviton. String 
theory requires extra dimensions beyond the four of Minkowski space; we 
would then live on a four-dimensional surface of this space. Continuity of the 
probability density requires the extra dimensions to be compact: a detour of 
the fields through an extra dimension would come back to the same amplitude 
and phase. Otherwise particles could disappear somewhere and reappear in a 
different space-time point. 

Gravitons, the quanta of such a field theory of gravity, are exceedingly dif- 
ficult if not impossible to detect. One should not confuse the recently detected 
gravitational waves with the action of gravitons. These waves relate to gravi- 
tons like an intense laser beam to a single photon®. And they exist. In 1982, 
Joseph Taylor Jr. and Joel M. Weisberg found that the orbit of a pulsar 
gyrating around a neutron star was slowly shrinking due to energy loss by 
gravitational waves [440]. The waves themselves were first detected in 2015 by 
the twin Laser Interferometer Gravitational-wave Observatory (LIGO) detec- 
tors, located in Livingston, Louisiana, and Hanford, Washington, USA as well 


7A simple argument to derive this scale is found in Sabine Hossenfelder’s review of the 
experimental search for quantum gravity [585]. 

8The number of photons emitted by a 1mW laser pointer is of the order of 1015 per 
second. 
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as the Virgo interferometer near Pisa, Italy [630]. Given the incredible small- 
ness of the effect, this is an admirable experimental break-though all by itself. 
However, if gravitons exist, gravitational waves correspond to immense num- 
bers of them, 10°°/v? according to one estimate [565], with v the frequency 
of the wave. Single gravitons, to be detected in analogy to the photoelectric 
effect or Compton scattering for photons, have an incredibly small cross sec- 
tion. Even a detector of the size of Jupiter would not be able to detect a 
single graviton from conceivable astrophysical sources [565]. This led Freeman 
Dyson to argue that a graviton may be undetectable [597] in principle. He 
writes [559, p. 221]°: 


According to my hypothesis, the gravitational field described 
by Einstein’s theory is a purely classical field without any quantum 
behavior. Gravitational waves exist, and can be detected, but they 
are classical waves and not collections of gravitons. If this hypoth- 
esis is true, we have two separate worlds, the classical world of 
gravitation and the quantum world of atoms, described by sepa- 
rate theories. The two theories are mathematically different and 
cannot be applied simultaneously. But no inconsistency can arise 
from using such theories, because any differences between their 
predictions are physically undetectable. 


This would mean to give up on the reductionist picture of Figure 1, reluctantly 
giving up the dream of a final unified theory. But all may not be lost for exper- 
imentalists and effects of quantum gravity other than the detection of single 
gravitons may be observable, especially in the vicinity of black holes [585]. 

A complementary theoretical approach does not try to unify forces but 
takes general relativity more seriously. Its proponents argue that space-time 
may itself be quantised. Gravity thus stays a property of space-time and is not 
just another force. This approach is represented by loop quantum gravity [508]. 
Here quantisation means that space and time themselves are quantised, that 
there is a minimum distance. At dimensions comparable to the Planck length, 
space and time are then a network with an extremely fine mesh made of so- 
called loops!?. As a consequence, surface and volume are also quantised and 
divergences coming from infinite densities disappear; the minimum distance 
provides a natural cut-off for infrared divergences. There is thus no need for a 
perturbative approach, neither to gravity nor its matter couplings. Space-time 
is a dynamical object of the theory, with no need of a fixed background metric. 
I admit that the mathematics of this fascinating approach is not within my 
reach and refrain from venturing further. 

What needs attention, though, is the different role of time in the two 
approaches sketched above. In the field theoretical approach, time keeps its 
role as the principle parameter of system evolution. Matter and interactions 


°© F. Dyson, with permission of New York Review Books. 
10Please do not confuse these loops of space-time with the particle loops in vacuum 
polarisation, shown in Figures 9.12 and 9.13. 
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evolve in space as a function of time. In the loop approach to quantum gravity, 
time loses that special role. In fact Carlo Rovelli argues that it disappears from 
consideration [660]. It has no unity, since it is different for every observer; no 
present which can be unambiguously defined; and no direction since past and 
future are equivalent, CP violation not withstanding. It is not a parameter, but 
an aspect of a dynamical field. And it is intrinsically ours, a human concept. 

It is thus entirely possible that the reductionist diagram, Figure 1 in Chap- 
ter 1, which served as a guiding line throughout much of this book, has a basic 
flaw. If gravity is not a force like the others, it will not join the Standard Model 
of particle physics but stay a property of space-time, probably together with 
dark energy. Maybe the whole reductionist approach is facing its limits and 
I will at some point have to reluctantly join Freeman Dyson in his harsh 
judgement [559, p. 13]1!: 


A reductionist philosophy, arbitrarily proclaiming that the 
growth of understanding must go only in one direction, makes no 
scientific sense. Indeed, dogmatic philosophical beliefs of any kind 
have no place in science. 
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